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5 TRANSGENIC PROTEIN PRODUCTION 

Field of thP Invpntinn 

This invention relates to HSA-encoding DNA molecules, vectors 
10 containing same and HSA-producing transgenic mammals. 

Backnrnnnri of the Im/Pntlnn 

Human serum albumin (HSA) is a globular, non-glycosylated protein 

1 5 (MW 65,000) synthesized by the liver. Circulating in the blood stream at levels 
of 42 grams/liter, it is the most abundant serum protein. HSA is involved in a 
number of essential functions, including sustaining normal bloodstream 
osmolarity, regulating blood pressure and transporting fatty acids, amino acids, 
bile pigments and numerous small molecules. Clinically, HSA is used in large 

20 quantities to replace blood volume in acute phase conditions such as trauma 
and severe burns or surgical procedures. Currently, medical supply of HSA 
depends on the fractionation of donated human blood. At the present time, the 
cost of purifying HSA from blood is relatively low, since HSA as well as other 
blood products can be simultaneously purified from the same source. 

25 However, as other blood products, such as coagulation factors, are produced 
by biotechnology instead of purified from human blood, market dynamics will 
increase the relative cost of purification of HSA from blood. The threat of a 
diminishing supply of donated blood, rising costs of purifying HSA from blood 
and the potential risk of contamination with infectious viruses that cause 

30 hepatitis, AIDS and other diseases make an alternative source of production of 
large quantities of HSA desirable. As such, alternative approaches to the 
production of large quantities of HSA are required. 

Recombinant DNA technology has been used increasingly over the past 
35 decade for the production of commercially Important biological materials. To 
this end, the DNA sequences encoding a variety of medically important human 
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proteins have been cloned These include insulin, plasminogen activator, 
alpha-antitrypsin and coagulation factors VIII and IX. 

The expression of DNA sequences encoding these and other proteins 
5 has been suggested as the ideal source for the production of large quantities of 
mammalian proteins. A variety of hosts have been utilized for the production of 
medically important proteins including bacteria, yeast, cultured cells and 
animals. In practice, bacteria and yeast often prove unsatisfactory as hosts 
because the foreign proteins are often unstable and are not processed 

1 0 correctly. In bacteria, HSA is produced as an insoluble aggregate which 
requires processing to yield the mature, soluble protein. HSA has also been 
produced in the yeast Saccharomyces, but at low levels and with a high 
proportion being either fragmented, cell associated or insoluble (Sleep et aL, 
1990, Bio/Technology 8:42-46; Etcheveiry et aL, 1986, Bio/Technology 4:729- 

15 730; Quirk et aL, 1 989, Biotech. Appl Biochem. 1 1 :273-287). 

In light of this problem, the expression of cloned genes in mammalian 
tissue culture has been attempted and has, in some instances, proved a viable 
strategy. However, batch fermentation of animal cells is an expensive and 

20 technically demanding process. Transgenic animals have also been proposed 
as a source for the production of protein products. The production of 
transgenic livestock offers a number of potential applications including 
"Molecular Farming" (also referred to as Genetic Farming) where proteins of 
medical or commercial importance are targeted for high level expression and 

25 production in the mammary gland with subsequent secretion into the milk of 
such genetically engineered animals. The feasibility of this approach was first 
tested in transgenic mice. 

WO-A-8800239 discloses transgenic animals which secrete a valuable 
30 pharmaceutical protein, in this case Factor IX, into the milk of transgenic sheep. 
EP-A-0264166 also discloses the general idea of transgenic animals secreting 
pharmaceutical proteins into their milk. 

Early work with transgenic animals, as represented by WO-A-8800239* 
35 has used genetic constructs based on cDNA coding for the protein of interest. 
The cDNA will be smaller than the natural gene, assuming that the natural 
gene has introns, and for that reason is easier to manipulate. It is desirable for 
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commercial purposes to improve upon the yields of proteins produced in the 
milk of the transgenic animal. 

Brinster et al. (EMS 85 836-840 (1 988) ) have demonstrated that the 
5 transcriptional efficiency of transgenes having introns in transgenic mice is 
increased over that of cDNA. Brinster et al.. show that all the exons and introns 
of a natural gene are important both for efficient and for reliable expression 
(that is to say, both the levels of the expression and the proportion of 
expressing animals) and is due to the presence of the natural introns in that 

1 0 gene. It is known that in some cases this is not attributable to the presence of 
tissue-specific regulatory sequences in introns, because the phenomenon is 
observed when the expression of a gene is redirected by a heterologous 
promoter to a tissue in which it is not normally expressed. Brinster et al.. 
suggested that the effect is peculiar to transgenic animals and Is not seen in 

15 cell lines. However, Huang and Gorman (1990, Nucleic Acids Research 

18:937-947) have demonstrated that a heterologous intron linked to a reporter 
gene can increase the level of expression of that gene in tissue culture cells. 

The problems of yield and reliability of expression can not be overcome 
20 by merely following the teaching of Brinster et al.. and inserting into 

mammalian genomes transgenes based on natural foreign genes as opposed 
to foreign cDNA. First, as mentioned above, natural genes having introns are 
larger than the cDNA coding for the product of the gene since the introns are 
removed from the primary transcription product before export from the nucleus 
25 as mRNA. It is technically difficult to handle large genomic DNA. 

Secondly, the longer the length of manipulated DNA, the greater chance 
that restriction sites occur more than once, thereby making manipulation more 
difficult. This is especially so given the fact that in most transgenic techniques, 

30 the DNA to be inserted Into the mammalian genome will often be isolated from 
prokaryotic vector sequences (because the DNA will have been manipulated in 
a prokaryotic vector, for choice). The prokaryotic vector sequences usually 
have to be removed, because they tend to inhibit expression. So the longer 
the piece of DNA, the more difficult it is to find a restriction enzyme which will 

35 not cleave it internally. 
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Attempts to achieve protein expression utilizing cDNA encoding the 
protein instead of the full length gene, have generally resulted in low protein 
yields. A number of workers recognized the desirability of improving upon the 
yields and reliability of transgenic techniques obtained when using constructs 
5 based on cDNA. 

Archibald et aL (WO90/05188) noted that with certain proteins higher 
yields (than could be obtained utilizing cDNA) could be obtained when at least 
some of the naturally occurring introns were utilized. Palmiter et aL (1991 . 
10 Proc. Natl. Acad. Sci, USA 88:478-482) also found that the level of expression 
of a transgene was higher when the transgene included some introns as 
compared with the transgene composed of a cDNA. However, the level of 
expression with less then all of its natural introns was reduced when compared 
to the level of expression obtained with the entire gene with all of its introns. 

15 

Summary of the Invention 

The present invention provides DNA constructs comprising a promoter 
DNA sequence and a DNA sequence coding for human serum albumin. In one 

20 embodiment the human serum albumin sequence comprises at least one, but 
not all, of the introns in the naturally occurring gene encoding for the HSA 
protein. In another embodiment the DNA constructs comprise a 5* regulatory 
sequence which directs the expression and secretion of HSA protein in the 
milk of a transgenic animal. Preferably, the promoter gene is a milk protein 

25 promoter sequence such as B-lactogiobulin, whey acidic protein or B-casein. 
Most preferably the secreted protein is human serum albumin. The present 
invention also provides vectors comprising the constructs of the present 
invention. 

30 The DNA construct of the present invention encoding HSA comprises 

two contiguous exons encoding HSA and an HSA intron. In a preferred 
embodiment , the DNA construct of the present invention provides for 
expression of HSA in mammalian cells and milk at higher levels than the 
naturally occurring HSA gene or HSA cDNA. In a most preferred embodiment 

35 the DNA construct of the present invention comprises HSA exons and introns 
selected from the group consisting of introns 1-6, 7-14, 1+7-14, 1 + 2 +12-14, 2 
+1 2-1 4, 2 + 7-1 4 and 1 + 2 + 7-1 4. 
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In another embodiment , a DNA construct of the present invention 
encoding HSA comprises one but not all of the first 7 introns of the HSA gene, 
and one of the last 7 introns of the HSA gene. 

In another embodiment, a DNA construct comprising DNA sequences 
encoding human serum albumin under the control of a mammary tissue 
specific promoter, said DNA construct expressed by the mammary glands of a 
lactating female transgenic mammal is provided. 



The present invention also provides a transgenic mammal having 
incorporated into its genome a DNA construct comprising DNA sequences 
encoding human serum albumin operably linked to a mammary tissue specific 
promoter, said DNA construct expressed by the mammary glands of a lactating 
1 5 female transgenic mammal. Preferably, the promoter is the B-lactoglobulin 
protein promoter. 

The present invention also provides a method of making a transgenic 
mammal having incorporated into its genome a DNA construct encoding 
20 human serum albumin and a mammary tissue specific promoter, said DNA 
construct expressed by mammary glands of a lactating female transgenic 
mammal comprising providing a DNA construct containing the B-lactoglobulin 
promoter operably linked with nucleotide sequence encoding human serum 
albumin. 

25 

The present invention also provides a transgenic mammal which 
secretes HSA in the milk of lactating females. 

Other and further objects features and advantages will be apparent from 
30 the following description of the presently preferred embodiments of the 

invention, given for the purposes of disclosure when taken in conjunction with 
the accompanying drawings. 



Brief Description of tha Figures 
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The invention will be more readily understood from a reading of the 
following specification and by reference to the accompanying drawings forming 
a part thereof: 

5 Figure 1 is a map of the human serum albumin gene (1 A) and the 

sequence of the HSA gene (1B). 

Figure 2 is a graphical depiction of the DNA constructs. 
Figure 3 demonstrates a fluorograph of SDS PAGE from in vitro 
expression analysis. 
1 0 Figure 4 demonstrates a Southern blot analysis of transgenic mice 

carrying HSA constructs. 

Figure 5 demonstrates a dot blot analysis (5A) and a Western blot 
analysis (5B) of BLG expression in the milk of transgenic animals. 

Rgure 6 demonstrates a dot blot analysis for detection (6A) and 
1 5 quantitation (6B) and a Western analysis (6C) of HSA expression in the milk of 
transgenic animals. 

Rgure 7 demonstrates a non-denaturing gel analysis of protein in the 
milk of transgenic animals by Coomassie stain (7A) and Western blot analysis 
(7B) of HSA expression in the milk of transgenic animals. 
20 Rgure 8 demonstrates an HSA RNA analysis (Northern) of tissues of 

transgenic animals. 

Rgure 9 demonstrates in situ detection of HSA RNA. 
Figure 10 demonstrates a Western analysis of HSA expression by 
mammary explants of transgenic animals. 
25 Rgure 11 demonstrates the immunohistochemical detection of HSA and 

8-casein in mammary glands of virgin and lactating transgenic mice of strain 
#23. 

Figure 12 demonstrates the immunohistochemical detection of HSA and 
{J-casein in mammary glands of virgin and lactating transgenic mice of strain 

30 #69 

Rgure 13 demonstrates the immunohichemical detection of HSA 0- 
casein in mammary glands of virgin and lactating control non-transgenic mice. 

Rgure 14 demonstrates a Northern analysis of HSA, BLG and p-casein 
RNA expressed in the mammary glands of virgin, pregnant and lactating 
35 transgenic and non-transgenic control mice. 

Rgure 15 demonstrates (by metabolic labeling and 
immunopredpitation) the synthesis and secretion of HSA and BLG from 
mammary explants of virgin, pregnant and lactating transgenic mice. 
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Figure 16 demonstrates (by metabolic labeling and 
immunoprecipitation) the hormonal control of HSA and BLG synthesis and 
secretion from mammary explants, cultured over time, from virgin transgenics. 

Figure 17 demonstrates Western immunoanalysis of HSA secreted in 
5 milk of several transgenic mouse strains. 

Figure 18 demonstrates (by metabolic labeling and 
immunoprecipitation) expression and secretion of HSA from mammary 
explants as evaluated under several conditions. 

Figure 19 demonstrates a correlation between the levels of expression 
1 0 of HSA in the milk of transgenic strains and the levels of secretion of HSA from 
mammary explants from virgin females of these same strains. 

Figure 20 demonstrates a Western immunoanalysis of HSA secretion 
(upper panel) and BLG secretion, by Coomassie staining, (lower panel) by 
mammary explants of aborted transgenic goats. 

15 

Detailed Description of thp Preferred Fmhnriimpntg 
Definitions: 

20 

The term "naturally occurring HSA gene" means the DNA sequences 
which encode the HSA protein and includes exons and introns in their native 
positional relationships. The naturally occurring HSA gene has been 
sequenced and the sequence reported by Minghetti et aL, J. Biol. Chem. 

25 261 :6747-6757 (1986). As used herein, HSA base pair (bp) positions are 
related to this published sequence which is also shown on Figure 1B. The 
numbering system used herein is defined such that the first bp (A) of the HSA 
translational initiation codon (ATG) is numbered as bp 1776 which is bp 40 on 
the sequence shown on Figure 1 B. In the native state the HSA gene includes 

30 5' flanking sequences (including promoter sequences) which are responsible 
for initiation and regulation of transcription and expression and 3' flanking 
sequences, as used herein the term "naturally occurring HSA gene" need not 
include these flanking sequences. In the constructs of the present invention 
the native flanking sequences may be absent or substituted by a heterologous 

35 sequence. 
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As used herein, the term "intron" (also called intervening sequences) are 
those sequences of a naturally occurring gene which are included within the 
transcription unit of the gene but do not encode the natural gene product 
protein. The introns are transcribed into the precursor RNA, but are removed 
5 during the processing (splicing) of the RNA to its mature form, messenger RNA 
(mRNA). The introns are located between flanking exons. In this specification 
the term "intron" includes the whole of any natural intron or part thereof. 

As used herein the term "exon" refers to DNA sequences which are 
10 included within the transcription unit of the gene and maintained in the mature 
mRNA following processing and which encode the gene product protein. 
When an intron is deleted or removed from the naturally occurring gene, the 
two exons which naturally flank that intron become adjoined as contiguous 
exons. 

15 

The DNA constructs of the present invention will generally be suitable 
for use in expressing the HSA protein in mammalian cells and, preferably, in 
the mammary gland of a transgenic animal with subsequent secretion of HSA 
in the milk. The DNA constructs of the present invention comprise DNA 

20 sequences encoding the HSA protein together with 5* flanking regulatory 
elements which include promoter sequences. When expression in mammary 
tissue is desired the 5' regulatory sequences are chosen which directs the 
expression and secretion of HSA protein in the milk of a transgenic animal. 
Preferably, the promoter is a milk protein promoter sequence such as B- 

25 lactoglobulin, whey acidic protein or B-casein. When expression of the HSA 
encoding construct of the present invention in tissue culture cells is desired, an 
enhancer sequence may be included in the construct. Enhancer elements may 
be derived from SV40, human cytomegalovirus or any other source. The 
choice of enhancer will be known to one of skill in the art. The constructs of the 

30 present invention also comprise polyadenylation (poly A) signals and sites. 
The polyadenylation signal may be a homologous signal encoded by the 
native HSA gene or may be heterologous, for example, the BLG or SV40 poly 
A sites. The choice of promoter, poly A or other regulatory elements will be 
known to those of skill in the art. 

35 

The species of animals selected for expression is not particularly critical, 
and will be selected by those skilled in the art to be suitable for their needs. 
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Clearly, if secretion in the mammary gland is the primary goal, as is the case 
with preferred embodiments of the invention, it is essential to use mammals. 
Suitable laboratory mammals for experimental ease of manipulation include 
mice and rats. Domestic farm animals such as rabbits, cows, pigs, goats and 
5 sheep provide larger yields than other mammals. Preferably, sheep and goats 
are utilized because of the relative annual milk production in relation to 
generation time, experimental production time, and cost. 

According to another aspect of the invention, there is provided a vector 
1 0 (or DNA construct) comprising a genetic construct comprising at least one HSA 
intron and fewer than all of the HSA introns which vector when used to 
transfect a mammalian cell expresses HSA at a higher level of expression than 
the full naturally occurring HSA gene. 

1 5 According to another aspect of the invention, there is provided a 

mammalian or other animal cell comprising a construct as described above. 
According to a sixth aspect of the invention, there is provided a transgenic 
mammal or other animal comprising a genetic construct as described above 
integrated into its genome. It is particularly preferred that the transgenic animal 

20 transmits the construct to its progeny, thereby enabling the production of at 
least one subsequent generation of producer animals. 



25 



The DNA sequence of the naturally occurring HSA gene has been 
determined. Figure 1 demonstrates a map of the HSA gene and its sequence.. 



In order to make the DNA constructs of the present invention several 
different approaches were required. The ovine BLG gene was cloned from 
high molecular weight liver DNA as two EcoR I subgenomic fragments (5'-half 
approximately 4.3 kb and 3'-half approximately 4.4 kb) into the EcoR I site of 

30 lambda gt10 vector. The two halves were subcloned into pGEM-l, joined 
together at their EcoR I sites within the transcriptional unit, by using adaptor 
oligonucleotides which destroyed the EcoR I sites at the gene's 5'- and 3*-ends 
and which introduced Sal I sites at these positions. A unique SnaB I site was 
introduced into the Pvu II site within the 5'-untranslated region of exon 1. This 

35 results in vector p585 for the expression of Ji-lactoglobulin and contains 
approximately 3kb of 5'-flanking promoter sequences. Constructs are 
represented in Figs. 2 A and 2B. The BLG 5'-flanking sequences were 
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extended by cloning from ovine liver DNA a Hind III subgenomic fragment 
extending from the Hind III site within the transcriptional unit upstream 
approximately 8 kb to a 5*-Hind III site (p644 and p643) or a Sac I subgenomic 
fragment extending from the Sac I site within the original 5-flanking sequences 
5 upstream approximately 8.6 kb to another Sac I site (p646 and p647). The 
former constructs include approximately 5.5 kb of 5-flanking promoter 
sequences. The latter construct includes approximately 10.8 kb of these 
sequences. 

1 o An HSA cDNA was isolated from a lambda gt1 1 -human liver cDNA 

library. The cDNA contains the complete HSA coding sequence including the 
prepropeptide sequences as well as 20 bp of 5'-untranstated and 141 bp of 3'- 
untranslated sequence. The HSA cDNA was inserted into the Pvu II site within 
the untranslated region of BLG exon 1 in the same orientation as the BLG 

15 coding sequence resulting in vector p575. A DNA fragment encompassing the 
HSA genomic sequence including part of exon 1 , intron 1 , exon 2, intron 2 and 
part of exon 3 was produced by PCR using a ^-oligonucleotide primer which 
overlaps the native BstEII site in HSA exon 1 and a ^-oligonucleotide primer 
which overlaps the native Pvu II site in HSA exon 3 using high molecular 

20 weight DNA purified from human lymphocytes as a template. The cDNA region 
between the BstEII site in exon 1 encoding region and the Pvu II site in exon 3 
encoding region was replaced with the corresponding genomic fragment (2401 
bp) from the PCR product. This resulted in an HSA minigene possessing 
introns 1 and 2 in their native locations, as included in vector p607. In order to 

25 introduce the first intron of the HSA gene into the HSA cDNA we first 

introduced a Cla I site into the region of the HSA cDNA which is derived from 
HSA exon 2 by replacing a G with an A in the third base position of the codon 
for the 34th amino acid of the HSA protein including the prepropeptide, by in 
vitro mutagenesis. The altered codon encodes arginine as did the original. 

30 HSA intron 1 DNA with flanking exon sequences was generated by PCR using 
a 5-oligonucleotide primer which overlaps the native BstEII site in exon 1 and 
a 3-oligonucIeotide primer which overlaps, and contains, the Cla I site 
introduced into exon 2 encoding DNA. A clone of the genomic PCR product 
containing exon 1 through exon 3 sequences was used as template for PCR 

35 generation of intron 1 and flanking sequences. The cDNA region between the 
native BstEII site in exon 1 encoding region and the introduced Cla I site in 
exon 2 encoding region was replaced with the corresponding genomic 
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fragment from the PCR product. The resulting HSA minigene possesses intron 
1 in its native location as included in vectors p599, p600, p598, p643 and 
p647. 

5 The deletion of the BLG coding sequence was accomplished by deleting 

sequences between the introduced SnaB I site within the 5'-untranslated 
region of BLG exon 1 and the native Xma I site within the 3-untranslated 
region of BLG exon 7 prior to introduction of HSA sequences, as seen in 
vectors p600 and p607. This vector maintains most of the untranslated BLG 

10 exon 7 including its polyadenylation signal and site as well as sequences 3 V of 
the BLG transcription unit. The SV40 early gene (T and t) polyadenylation 
signal and site downstream of HSA sequences in vectors p598, p643 and 
p647 as well as other vectors was obtained from SV40 DNA by restriction with 
Bell at its 5'-end (SV40 map position 2770) and BamH I at its 3'-end (SV40 

1 5 map position 2533). In these vectors all BLG sequences downstream of the 
introduced SnaB I site in the 5-untranslated BLG exon 1 including coding 
sequence, polyadenylation signal and site and 3Mlanking sequences were 
deleted. 

20 In order to obtain HSA introns 3-14, the HSA gene was cloned from 

human placental DNA. Three Nco I sites within the HSA gene sequence were 
identified. The first Nco I site lies about 275 base pairs upstream of HSA exon 
1. The second site lies within exon 7 and the third site lies about 227 base 
pairs downstream of exon 15. Digestion of human high molecular DNA with 

25 Nco I released two fragments of 8079 and 9374 base pairs which together 

encompass the entire HSA gene. The 8079 base pair fragment represents the 
5'-half of the HSA gene while the 9374 base pair fragment represents the 3- 
half of the gene. These fragments were used to make 2 separate subgenomic 
DNA libraries. HSA clones from these libraries were identified using an HSA 

30 cDNA probe. A clone containing the 5 - half of the HSA gene to exon 7 was 
designated p650. A clone (p651) identified as containing sequences of the 3'- 
half of the HSA gene were found to have an internal deletion of HSA 
sequences. Clone p651, did, however, contain HSA sequences extending 
from the Asp I site within HSA exon 12 through the Nco I site downstream of 

35 HSA exon 15. 
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In order to clone the HSA gene sequences between exon 7 and exon 12 
so as to obtain that region which includes introns 7-1 1 , PCR technology was 
utilized. Four PCR reactions were set up using synthetic priming 
oligonucleotides homologous to desired regions of the HSA gene containing 
5 useful restriction sites. The PCR reactions were designed so that the upstream 
end of reaction #1 overlapped the Nco i site within exon 7. The downstream 
end of reaction #1 overlapped the Avr II site within intron 8. This same Avr II 
site was overlapped by the upstream end of PCR reaction #2. The downstream 
end of PCR reaction #2 overlapped the Hind III site within intron 8. This Hind III 

1 0 site was also overlapped by the upstream end of PCR reaction #3. The 
downstream end of PCR reaction #3 overlapped the Xhol site in intron 10 
which was also overlapped by the upstream end of PCR reaction #4. The 
downstream end of PCR reaction #4 overlapped the Asp I site in exon 12. By 
this PCR strategy the entire region desired was obtained and adjacent PCR 

15 products were joined together using overlapped restriction sites. The products 
of PCR #1 and #2 reactions were ligated together at their common Avr II site 
within HSA intron 8 and cloned into plasmid pGEM-1 resulting in construct 
p679. This construct contains HSA sequences extending from the Nco I site in 
exon 7 to the Hind III site in the downstream end of intron 8. The product of 

20 PCR #3 and #4 reactions were ligated together at their common Xhol site 
within HSA intron 10 and cloned into plasmid pGEM-2 resulting in construct 
p676. This construct contains HSA sequences extending from the same Hind 
III site in the downstream end of intron 8 to the Asp I site in exon 12. 

25 When taken together with construct p650, containing the HSA gene 

sequences from the Nco I site upstream of HSA exon 1 (HSA gene base pair 
position 1462) to the Nco I site in exon 7 (HSA gene base pair position 9541) 
and construct p651 , containing HSA gene sequences extending from upstream 
of the Asp I site in exon 12 (HSA gene base pair position 15592) to the Nco I 

30 site downstream of exon 15 (HSA gene base pair position 18915) these 
constructs, p679 and p676, complete the cloning of the entire HSA gene. 

In order to assess the contribution of introns to the level of expression of 
HSA in the milk of transgenic animals, constructs comprising the HSA exons 
35 and various combinations of introns were constructed. Details of the various 
constructions are given in the examples which follow. 
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The constructs of the present invention were tested for their ability to 
support expression of HSA protein in vitro in tissue culture cells and in vivo in 
transgenic animals. The in vitro expression of HSA by the constructs of the 
present invention is described in detail in Example 15. The natural in vivo 
5 regulation of expression of milk proteins under the control of the native 

promoters (e.g. BLG) is complex and requires the influence of hormones and 
specific cell-cell interactions. The BLG promoter is not usually active in tissue 
culture cells. In order to drive expression in the COS-7 cells chosen for these 
in vitro tests, an SV40 enhancer was introduced within the BLG promoter. 

1 0 Details of the construction of the constructs of the present invention having the 
SV40 enhancer are given in Example 15. A transient assay for HSA 
expression in COS-7 cells was used to test the constructs. Briefly, constructs of 
the present invention were transfected into COS-7 cells, incubated for 48-72 
hours. Expression of HSA was determined by metabolically labeling de novo 

15 synthesized proteins with 35 S-methionine and immunoprecipitating labeled 
HSA protein, which had been secreted into the media supernatants, with HSA 
specific antibodies. Precipitated HSA was analyzed by SDS-PAGE and HSA 
bands detected by fluorography. 

20 COS-7 cells transfected with a construct which contained HSA cDNA, 

lacking introns, (p658) expressed HSA at low levels. Expression of HSA 
protein was also relatively low even when some of the introns were included in 
the construct. However, selection of certain combinations of introns [p656 
(introns 1-6), p684 (introns 7-14), p695 (introns 2+12-14), p697 (introns 

25 1+2+12-14), p693 (introns 1+7-14), p692 (introns 2+7-14), and p698 (introns 
1+2+7-14)] supported expression of HSA protein at levels equal to or even 
greater than the full length HSA gene (p685 (introns 1-14; full length)]. In vitro 
analyses are shown in Figs. 3A and 3B and summarized in Table 1 . 
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Table 1 



in vniD oonsiruci 


noA inuons 
Included 


1 ouol nf In 

UCVCl KJl II 1 

Vitro 

ExDression 


HomolonniKi In 

Vivo Construct 


In Vivo Expression i i 

III VI WU UApiv9wlUll 1 1 

Transgenic Milk 












p615 


0 (cDNA) 
(BLG coding 
seq.) 

(BLG 3'-seq.) 


1 


p575 

(BLG coding 
seq.) 

(BLG 3'-seq.) 


0/8 


P608 


1 


2 


P598 


See below 


p606 


1 

(BLG 3'-seq.) 


2 


p600 

(BLG 3'-seq.) 


0/6 








p599 

(BLG coding 
seq.) 

(BLG 3*-seq.) 


0/5 


p610 


1 &2 

(BLG 3'-seq.) 


3 


p607 

(BLG 3'-seq.) 


4/6 

(1-35 wj/ml) 


* p658 


0 (cDNA) 


1 






* p659 


1 


2 


p598 


1 /5 

(2.5 mg/ml) 








p643 

(5.5 kb BLG 
5'-seq.) 


0/2 








p 647 (10.8 kb 
BLG 5'-seq.) 


1 /6 

(2ug/ml) 


* p691 


2 


3 






• p660 


1 &2 


3 


p607 

(BLG 3'-seq.) 


see above 


*p656 


1-6 


6 


p652 


4/12 

(0.002 - 10 
mg/ml) 








p654 

(5.5 kb BLG 
5'-seq.l 


0/5 


* P682 


12-14 


2 


p688 


In progress 


*p684 


7-14 


6 


p687 


3/7 (0.005-5 
mg/ml) 


* p685 


1-14 


6 


p686 


1/2 ( i iig/mi) 


• P694 


1+12-14 


4 






• 0695 


2 + 12-14 


5 






* p697 


1 +2 + 12- 14 


7 


p812 


1 (0.8-1 mg/ml) 
3 not yet 
determ. 


* 0693 


1 +7-14 


6 






* p692 


2 + 7-14 


8 


p696 


3/4 (0.002-2.5 
mg/ml) 


* P698 


1 +2 + 7-14 


8 







WO 93/03164 



15 



PCT/US92/06300 



In vitro constructs marked with (*) vary only In presence of HSA introns. (BLG 5'- 
sequences, SV40 enhancer and 3-SV40 poly A are the same) 

Except where indicated constructs lack the BLG coding sequences and BLG 3'- 
5 sequences including poly A. Unless indicated the SV40 poly A site was utilized. 

Unless indicated the 3kb BLG S'-flanking sequences (promoter) were utilized. 

In vitro expression level ranges 1 (low expression) to 8 (high expression) are 
semiquantitative comparisons where each increment represents several fold to 
1 0 many fold differences in level of expression. 

"Not yet detenu.' means that transgenic produced but presence of HSA in milk 
not yet determined. "Not yet quant." means that expression of HSA in the milk of 
the transgenic has been shown but the amount not yet quantitated. 

15 

Expression of HSA in tissue culture cells from these constructs 
demonstrated that the level of expression of HSA is modulated by the specific 
complement of HSA introns, i.e., the number of HSA introns present in the 
construct, the specific introns incorporated, the relative locations of introns, and 

20 the synergism between specific introns. Several fold higher levels of 
expression are obtained with constructs containing HSA minigenes with 
specific subsets of introns as compared with the entire HSA gene with all of its 
introns. Levels of expression of HSA are particularly high when supported by 
HSA minigenes which comprise one but not all of the first 7 introns of the HSA 

25 gene and one of the last 7 introns of the HSA gene. There are 5 Alu 

sequences (family of repeated DNA sequences) within HSA introns. Three of 
these Alu sequences are located within the first 7 introns and 2 are located 
within the last 7 introns. Intron 2 has 2 Alu sequences, introns 7, 8, and 11 
each have 1 Alu sequence. There appears to be an association between the 

30 presence of the Alu sequences within introns and the introns' positive effect on 
resultant levels of expression obtained with HSA minigenes. 

In vivo expression of heterologous protein in transgenic animals by the 
constructs of the present invention was assessed by injecting the constructs of 

35 the present invention into murine oocytes to produce transgenic mice. 

Transgenic mice were produced following the general methods described by 
Hogan et al... "Manipulating the mouse embryo: a laboratory manual" CSHL 
(1986). The details are further described In Example 16. Two different 
heterologous proteins, BLG and HSA, were expressed in the milk of the 

40 transgenic mice. Mice carrying the BLG or the BLG/HSA constructs of the 
present invention were detected by analysis of somatic DNA from the tails of 
newborn mice utilizing a 32p dna probe which recognizes both BLG and HSA 
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DNA sequences. Details of the assay are described in Example 16. In contrast 
to the in vitro assay, no enhancer was needed to drive the expression in 
transgenics of proteins under the direction of the BLG promoter. Female mice 
that were determined to carry the construct of the present invention, when 
5 mature were mated to non-transgenic males. Milk was collected from nursing 
transgenic females after parturition and the milk analyzed for the presence and 
amount of heterologous protein (BLG or HSA). Founder male transgenic mice 
were bred to non-transgenic females to produce female offspring to test for the 
expression of heterologous proteins in their milk. Heterologous protein 

1 0 expressed in milk was detected by a dot blot assay using antibodies to BLG or 
HSA protein. BLG expression in milk was detected by spotting the milk sample 
onto a nitro- cellulose fitter. Anti-BLG antibodies were then contacted with the 
filter. 125 l-Protein A was then contacted with the filter to bind quantitatively to 
the bound antibodies. HSA expression in milk was detected by spotting the 

15 milk sample onto a nitrocellulose filter, lodinated anti-HSA monoclonal 
antibodies were then contacted with the filter. The radioactivity was 
determined by autoradiography and correlated with standards by densitometry 
of the autoradiographs to quantitate the amount of heterologous protein 
expressed in the transgenic milk. 

20 

The capability of the BLG promoter to promote expression of BLG and 
HSA was tested both in vitro and in vivo. A 3kb DNA fragment upstream of the 
BLG transcription unit (i.e. the SMIanking region) was utilized in most of the 
constructs of the present invention. The efficacy of this 3kb promoter was 
25 tested by making transgenic mice utilizing construct p585 which contain the . 
BLG gene under the control of the 3kb BLG promoter. 

All transgenic mouse strains produced, carrying the native sheep BLG 
gene (construct p585), expressed BLG at high levels in the mammary gland 

30 and milk as determined by dot blot. An example is shown in Fig. 4. Levels 
ranged from 1 to 8.5 mg/rnl BLG (see Table 2). This is somewhat lower than 
the range (3-23 mg/ml) found by Simons et aL (1987, Nature 328:530-532) 
utilizing 4.3 kb BLG SMIanking sequences. However, no increase in BLG 
expression was observed with S'-flanking sequences of 5.5 kb (construct p644) 

35 or 10.8 kb (p646). These results demonstrate that the BLG 3kb 5'-f tanking 
sequences in the constructs of the present invention contain all the 5'-control 
elements necessary to direct high level expression to the mammary gland. 
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This 3kb BLG promoter fragment was also used in constructs comprising 
all or parts of the HSA DNA. This 3kb fragment proved sufficient to promote 
BLG and HSA expression both in vitro and in vivo. 

5 

The milks of lactating females from transgenic lines produced from 
BLG/HSA constructs were analyzed for the presence of HSA by immuno-dot 
blot using iodinated anti-HSA monoclonal antibodies. An example is shown in 
Fig. 5A. Our initial attempt to produce transgenic mice expressing HSA in their 

1 0 milk was by introducing the HSA cDNA into the S'-untranslated region of the 
first exon of the BLG gene of vector p585, resulting in vector p575. None of the 
8 lines secreted detectable levels of the human protein (Table 2). ft appeared 
that although our BLG vector was able to drive expression of its own BLG gene, 
it was unable to support the expression of the inserted HSA cDNA. Others 

1 5 have found that in transgenics, the levels of expression of heterologous genes 
under the control of a variety of 5'-regulatory elements (promoters) were 
increased by the incorporation of introns into the heterologous transcripts. 
Therefore, we tested a series of vectors in which the sheep BLG promoter was 
fused to HSA minigenes containing a variety of intron combinations. 

20 

Analysis of expression from one HSA minigene, containing HSA intron 
1, within 3 constructs (p599, p600, p598) demonstrated only 1 transgenic line 
(#23) out of 1£ produced expressing detectable levels of HSA in its milk. 
Inclusion of HSA intron 1 alone in the constructs of the present invention is not 
25 sufficient to obtain a high percentage of transgenic lines which express. As 
shown on Fig. 5B, female mice of line 23 secrete high levels, greater than 2 
mg/ml of HSA into their milk, and have stably transmitted this ability to their 
progeny for over two years. 

30 Mouse milk contains a significant amount of endogenous mouse serum 

albumin which co-migrates with human serum albumin in SDS-PAGE gels 
(data not shown). Immuno-detection assays demonstrated that the anti-HSA 
monoclonal antibody specifically detected the human protein and not the 
mouse, protein. The human and mouse proteins were also distinguishable by 

35 their distinct electrophoretic mobilities on native polyacrylamide gels. Milk from 
expressing line 23 clearly contains both the human (low mobility) and mouse 
(high mobility) albumins as seen by generalized protein staining wfth 
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coomassie (Fig. 6). The lower mobility band was confirmed to be HSA by 
native gel and immunoblot analysis (Rg. 6). 

The secreted HSA protein behaves in a manner indistinguishable from 
5 purified HSA or the HSA found in human milk in its electrophoretic mobility 
through native gels as well as in denaturing gels. In native gels the human 
protein migrates with a different mobility from endogenous mouse serum 
albumin. 

1 0 Reproducible expression of HSA in the milk of transgenics was 

achieved first only when the first 2 HSA introns were included in the construct 
(p607) with 4 of the 6 transgenic lines examined expressing HSA in their milk. 
Although an improvement in the frequency of expressing transgenics 
(penetrance), the levels of expression were disappointingly low (1-35 ug/ml). 

15 

A significant improvement in both the number of transgenics which 
express HSA in their milk (penetrance) as well as the levels of expression 
resulted from the inclusion of additional HSA introns or intron combinations 
within transgenic vectors. Four of twelve transgenic strains generated from 

20 vector p652 (HSA introns 1-6) express HSA in their milk with two strains 

expressing HSA above 1 mg/ml (one of which expresses at about 10 mg/ml). 
Three of seven strains generated from p687 (HSA introns 7-14) expressed 
HSA with one strain expressing at about 4-5 mg/ml. Three of four strains 
generated from p696 (HSA introns 2 + 7-14) expressed HSA with one strain 

25 expressing at about 2.5 mg/ml. Four transgenic strains have thus far been 
generated from vector p812 (HSA introns 1 + 2 + 12-14). Only one of these 
strains is old enough to have already been tested for the expression of HSA in 
milk and, in fact, this one strain does express high levels of HSA at about 0.8- 
1.0 mg/ml. Only two strains have been generated from p686 (HSA gene; i.e.. 

30 all HSA introns 1-14). One of the two expressed HSA but at low levels. 

The overall in vivo results correlate with the in yjim expression results. 
That is, extremely low levels of HSA are expressed from the cDNA construct in 
vitro and no HSA is detectably expressed from the cDNA construct inyjyjQ in 
35 transgenics. The inclusion of intron 1 into the in vitro construct results in 
slightly higher levels of in vitro expression as compared with the cDNA 
construct. Similarly, one transgenic strain derived from the transgenic 
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construct with HSA intron 1 did express. The inclusion of introns 1 and 2 in 
vitro also resulted in higher expression than the cDNA. These two introns in 
transgenics resulted in a much better penetrance then either intron 1 or the 
cDNA. The inclusion of the first 6 introns {1-6) or the last 8 introns (7-14) or all 
5 of the introns (1-1 4} within the in vitro constructs resulted in a markedly higher 
level of expression. These in vitro results correlated with in vivo results where 
about 33% (HSA introns 1-6) , 43% (HSA introns 7-14) and 50% (HSA introns 
1-14) of respective transgenic strains, generated from vectors with indicated 
HSA introns, express HSA in their milk and two or one strains, respectively, 

1 0 generated from vectors with introns 1 -6 or 7-1 4 express at levels greater than 1 
mg/ml. In addition, in vitro vectors possessing either HSA introns 2 + 7-14 or 1 
+ 2 + 12-14 support expression of even higher levels of HSA in vitro than the 
previous group of vectors. Similarly, 75% (3 of 4) of transgenic strains 
generated from a transgenic vector with HSA introns 2 + 7-14 and 100% (1 of 1 

1 5 thus far tested) generated from a vector with introns 1 +2 + 12-14 express HSA 
with one strain of each expressing high levels at about or above 1 mg/ml. 
While the number of transgenic strains thus far analyzed is limited, it appears 
that higher penetrance and/or levels of expression can be obtained in 
transgenics using vectors comprised of subsets of the introns of the HSA gene 

20 as opposed to the entire HSA gene with ail of its introns. The production of 
transgenic goats (Example 24) with constructs demonstrated to express high 
levels of HSA in the milk of transgenic mice or with constructs which are the 
transgenic counterparts of the in vitro constructs which support very high levels 
of HSA, should result in very high levels of expression in the milk of these dairy 

25 animals. 

In order to examine the tissue specificity of expression of HSA RNA total 
RNA was isolated from various tissues of transgenic female mice on day 10-12 
of lactation. RNAs were fractionated by electrophoresis, transferred to nylon 
30 membrane and probed with a 32 P-labeled HSA antisense RNA as described 
in Example 16. 

High levels of HSA mRNA were detected in the mammary gland of 
lactating females which secrete HSA into their milk (strain # 23) (Fig. 7). 
35 Transcripts are also found in skeletal muscle, kidney, liver and skin but not in 
other tissues (spleen, heart, salivary gland, lung and brain). The expression in 
liver appears to be endogenous mouse serum albumin . This ectopic 
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expression of transgene transcripts is not associated with any apparent 
physiological abnormality. Transgene transcripts from other genes fused to the 
5'-flanking sequences of the genes of milk specific proteins have been shown 
to accumulate in non-mammary tissues such as salivary gland, kidney and 
5 brain. 

The expression of HSA in RNA in the mammary tissue of transgenic 
strain #23 was also demonstrated by in situ hybridization as described in 
Example 16 and shown in Fig. 9. 

10 

The screening of transgenic mice for their expression of HSA in their 
milk was standardly performed on the milk of lactating females following 
parturition. The expression of HSA from explant cultures of mammary glands 
of both virgin and lactating transgenic strain #23 was also demonstrated 

15 (Example 17 and Fig. 9). In addition, we have shown a correlation between 
levels of expression of HSA in milk and levels of expression and secretion of 
HSA from mammary explants of virgin females among different strains of 
transgenic mice. Assay of explant cultures of mammary tissue of young 
transgenic goats or other form of dairy animals would greatly facilitate the 

20 identification of transgenic animals which will ultimately express HSA in their 
milk upon lactation. 

Having now generally described the invention, a more complete 
understanding can be obtained by reference to the following specific 
25 examples. These examples are provided for the purpose of illustration only 
and are not intended to be limiting unless otherwise specified. 

Example 1 

30 A. CLONING OF THE SHEEP B-LACTOGLOBULIN GENE 
WITH 5' AND 3-FLANKING REGIONS fX22-1, X10-1) 

A restriction map of the sheep B-lactoglobulin (BLG) gene (Simons et 
al... 1988, Bio/Technology 6; 179-183) indicated that the transcribed BLG 
35 sequences possessed only one EcoR I site and that EcoR I sites existed about 
3 Kb upstream of the transcription region within 5 -flanking sequences and 
about 1 Kb downstream of the transcription unit within 3'-flanking sequences. 
Restriction digestion of the BLG gene and flanking regions would therefore 
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release 2 BLG gene fragments; (1) a fragment (approximately 4.1-4.3 Kb) 
made up of 5'-flanking sequences, including the BLG promoter, and the first 
part of the BLG transcription unit (designated 5-region) and (2) a fragment 
(approximately 4.4 Kb) composed of the rest of the transcription unit including 
5 the polyadenylation site and 3-flanking sequence (designated 3-region). 
Therefore, the BLG gene and flanking regions were cloned as two gene halves 
made up of these two EcoR I subgenomic fragments. 

High molecular weight sheep liver DNA was digested extensively with 

1 0 the restriction enzyme EcoR I. Restriction fragments were resolved on a 0.6% 
agarose gel along with size marker DNA standards. In order to identify the 
location of the BLG fragments on the gel, an analytical vertical strip of gel, 
made up of the size markers and about 5% of the restricted sheep DNA was cut 
from the rest of the gel. stained with Ethidium Bromide (EtBr) and 

15 photographed under UV light. The rest of the gel, the preparative portion, was 
wrapped in plastic wrap and put at 4°C until ready for use. The analytical 
portion of the gel was subjected to Southern analysis using a 32 P-labeled 
probe produced by the random primer method, with a bovine BLG cDNA as 
template. The bovine cDNA clone used as probe was kindly supplied by Dr. 

20 Carl A. Batt, Cornell University. Only a single band was detected of molecular 
weight of approximately 4.3 Kb. This single band represents both the 4.3 and 
4.4 EcoR I subgenomic fragments which comigrated in this gel system. This 
comigration was verified in other Southern analyses using 4.3 and 4.4 Kb 
specific probes. The corresponding region of the preparative gel containing 

25 the BLG fragments was cut out of the agarose gel. DNA was electroeluted from 
this gel piece, purified by Eluptip-d -{Schleicher and Schuell)(i.e. gel and elutip 
purified) and ethanol precipitated. 

In order to isolate and clone the BLG fragments, the purified subgenomic 
30 DNA eluted from the gel was ligated into a Stratagene Iambda-gt10 vector 
which had been previously digested with EcoR I and dephosphorylated. 
Ligation products were packaged with Gigapack plus extracts (Stratagene) and 
approximately 500,000 plaques of the resultant subgenomic library plated out 
on C600 cells (Stratagene) and incubated for plaque formation. Plates were 
35 lifted, in duplicate, onto nitrocellulose filters and treated by standard 

procedures, niters were subsequently baked, prehybridized (5 x SSPE, 1/25 
Blotto, 0.2% SDS) and hybridized with the same buffer containing ^P-bovine 
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BLG cDNA probe at 62°C overnight. Filters were washed, the last most 
stringent wash was with 1 x SSC, 0.2% SDS at 62°C. Filters were 
subsequently subjected to autoradiography. Several duplicate positive 
plaques were identified and subplaqued to purity in Y1088 cells. Clones which 
5 contained either the 5-region or the 3'-region of BLG were identified by 
differential hybridization with either BLG exon 2 and 5-probes or exon 5 and 
3'-probes, respectively. A clone possessing the 5-region (5*-clone), 
designated JL22-1 , as well as a clone possessing the 3-region (y-clone), 
designated 3110-1 were utilized for subsequent subclonings. 

10 

B. SUBCLONING BLG 5' AND 3' REGIONS FROM X22-1 and X10-1 
PHAGE INTO pGEM-1 PLASMID (p570, p568) 

Recombinant phage (3122-1 and 3110-1) and DNA were purified using 
1 5 LambdaSorb® (Promega) using their protocols. Cloned BLG DNA were 

released from recombinant phage DNA by digestion with EcoR I. The released 
BLG 5'-region (approximately 4.3 Kb) from X22-1 DNA and BLG 3'-region 
(approximately 4.1-4.4 Kb) from MO-1 DNA were gel and elutip purified and 
ethanol precipitated. 

20 

Plasmid vector, pGEM-1 , was prepared for the subcloning of the BLG 5*- 
region by digestion with Pvu II and EcoR I, and the large vector fragment was 
gel and elutip purified and ethanol precipitated. A cloning adapter was 
prepared by the annealing of two synthetic oligonucleotides as shown. 

25 

Sail (EcoR I) 



5 * GTCGACGCGGCCGC 3 ' 

3 ' CAGCTGCGCCGGCGTTAA 5 ' 

30 

(NOTE: Sites in parenthesis are not cleavable.) 

The adapter was designed to allow the EcoR I compatible end to ligate 
to either of the EcoR I ends of the BLG 5-region insert and by doing so to 
35 destroy the EcoR I site at that ligation junction, as the adapter sequence varied 
from an authentic EcoR I sequence by the incorporation of a G following the 5- 
AATT instead of a C as would be found in the authentic sequence. Further, the 
adapter would link the adjoined BLG 5-region to the Pvu II site of the prepared 
pGEM-1 vector by ligation of the blunt Pvu II site and blunt end of the adapter. 
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A BLG EcoR I end which did not ligate to an adapter was free to iigate to the 
EcoR I site of the prepared pGEM-1 vector. The prepared pGEM-1 vector, the 
BLG 5-region insert fragment and the adapter were ligated in one step. The 
BLG DNA could ligate between the adapter and plasmid EcoR I sites in either 
5 of its two orientations. Ligation products were transformed into cpj DH5 
cells to ampicillin resistance. Resistant bacterial colonies were analyzed for 
the presence of plasmid containing BLG inserts by Whatman 541 filter lifts and 
hybridization with 32 P-labeled BLG exon 2 and 5' probes. Positive colonies 
were selected and grown in LB-amp medium. Plasmid DNAs were prepared 

1 0 from these isolates. Restriction analysis of clones with BamH I allowed for the 
selection of clones containing the desired orientation of BLG insert. BamH I 
digestion of plasmids with the desired BLG orientation resulted in DNA 
fragments of approximately 4200 and 2800 bp while DNA fragments from the 
non-desired orientation were approximately 5800 and 1380 bp. Restriction 

1 5 analysis confirmed that the desired clone contains the BLG 5'-region between 
the Pvu II and EcoR I sites of pGEM-1. The blunt end of the adapter is adjacent 
to the pGEM-1 Pvu II site. The Pvu II site is no longer cleavable at this location 
of the final clone. The 5'-end of the BLG 5'-region is linked to the EcoR I 
compatible region of the adapter. This EcoR I site is destroyed by the ligation. 

20 A Sal I site had been introduced with the adapter, just upstream of the S'-end of 
the BLG region. This site is not found within either the BLG 5' or 3-regions. 
The EcoR I site of the 3'-end of the BLG 5'-region linked to the pGEM-1 EcoR I 
site is regenerated and available for later digestion and ligation to the 5'-end of 
the BLG 3-region. The desired correct BLG 5-region construct (clone) 

25 selected for further work was designated p570. Clones with the BLG 

sequences in the undesired orientation were designated p571 and utilized in 
later constructions. 

The EcoR I restricted, purified 4.4 Kb BLG 3'-region DNA was subcloned 
30 into plasmid pGEM-1 between the plasmid BamH I and EcoR I restriction sites. 
pGEM-1 digested with BamH I and EcoR I was gel and elutip purified, and 
ethanol precipitated. A cloning adapter made from two annealed synthetic 
oligonucleotides had the following sequence; 

35 (EcoRI) BamHI 

5 ' AATTAGCGGCCGCG 3 ' 

3 ' TCGCCGGCGCCTAG 5 ' 
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As with the previous adapter, it's EcoR I compatible end allowed for ligation 
with either of the EcoR I ends of the BLG 3'-region but resulted in loss of the 
EcoR I site. In this case the adapter EcoR I compatible site was followed by a 
T/A bp as opposed to a G/C bp as would be found in an authentic EcoR I 
5 sequence. A three part ligation with prepared pGEM-1 (BamH I and EcoR I 
ends), the adapter (BamH I and EcoR I compatible ends), and the BLG 3'- 
region insert (EcoR I ends) was performed. The BamH I site of the adapter 
should ligate to the BamH I site of pGEM-1 . The BLG 3'-region would ligate 
between the adapter EcoR I compatible site and the EcoR I site of pGEM-1 in 

1 0 either orientation. Ligation products were introduced into EL soli DH5 cells by 
transformation to ampicillin resistance. Clones with the desired BLG 3-region 
orientation were characterized by BamH I generated fragments of 
approximately 4500, 2000 and 800 bp. Restriction analysis of desired clones 
confirmed that the EcoR I site at the 5'-end of the BLG 3-region was ligated to 

1 5 the EcoR I site of pGEM-1 and that this regenerated the EcoR I site; the EcoR I 
site at the 3'-end of the BLG 3'-region was ligated to the EcoR I compatible site 
of the adapter resulting in the destruction of this EcoR I site; the ligated BamH I 
sites of the adapter and pGEM-1 had regenerated a BamH I site; and that the 
Sal I site of the pGEM-1 polylinker was just downstream of the BLG 3-region 

20 (and adapter). The EcoR I site at the 5-end of the BLG 3'-region would be 
used later to join together the two BLG gene halves. A desired clone, 
designated p568, was selected for further work. 

C. CONSTRUCTION OF A COMPLETE BLG VECTOR WITH A SnaB I SITE 
25 REPLACING THE Pvu II IN BLG EXON 1 (p585 vector) 

In order to facilitate the cloning of foreign genes (such as HSA) into the 
correct Pvu II site of the BLG vector (within the untranslated portion of BLG 
exon 1), a SnaB1 site was introduced into this Pvu II site. No SnaB I site exists 
30 within natural BLG sequences or in the pGEM bacterial plasmid sequences in 
which the BLG sequences are cloned. Therefore, the introduced SnaB I site is 
unique, simplifying the introduction of foreign genes into the appropriate 
location of the BLG sequences. 

35 The 5* portion of the BLG gene (EcoR I subgenomic) cloned into pGEM- 

1 (i.e. plasmid p570) possesses three Pvu II sites including the appropriate site 
within BLG exon 1 . The other two Pvu II sites were mapped to locations 
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approximately 2100 bp and 2600 bp, respectively, upstream of the Pvu II site 
within exon 1. Plasmid p570 was partially digested with Pvu II and linearized 
plasmid (approximately 7200 bp) was gel and elutip purified. A synthetic SnaB 
I linker oligonucleotide of the sequence 5-GTACGTAC-3' was self annealed. 
5 Annealed linker was ligated with the purified linearized plasmid p570. Ligation 
products were transformed into ELcoJi DH5 cells to ampicillin resistance. 
Desired recombinants were identified by the presence of a SnaB I site, the 
absence of the Pvu II site in exon 1 and the presence of the two upstream Pvu II 
sites. The correct plasmid was designated p583. 

10 

The 5-half of the BLG gene with the SnaB I site replacing the Pvu II in 
exon 1 was recombined with the 3-half of the BLG gene as follows. Construct 
p583 (containing the 5'-BLG region) was digested with Pvu I ( within pGEM-1) 
and EcoR I (at the junction of the 3' end of the 5' BLG region and the adjacent 
1 5 pGEM-1) The DNA fragment (approximately 5800 bp) containing the 3-portion 
of the plasmid Amp resistance gene, the plasmid ori, and the 5-portion of the 
BLG gene was gel and elutip purified and ethanol precipitated. 

The plasmid containing the 3'-half of the BLG gene, p568, was similarly 
20 digested with Pvu I and EcoR I. The DNA fragment of approximately 5800 bp 
containing pGEM-1 sequences including the 5*-portion of the Amp resistance 
gene up to its Pvu I site and complementary to those in the fragment described 
above, and the 3'-half of the BLG gene up to its 5' EcoR I junction with pGEM 
was gel and elutip purified and ethanol precipitated. The two purified 
25 fragments were ligated together and transformed into £, aili DH5 cells to 

ampicillin resistance. Only correctljr recombined fragments would regenerate a 
complete ampicillin resistance gene. Correct recombination of a complete BLG 
gene was confirmed by restriction analysis by the release of the full length BLG 
region of approximately 8800 bp with restriction enzyme Sal I. This new 
30 plasmid, designated p585, is composed of a pGEM-1 bacterial plasmid and the 
complete BLG gene as described, with a SnaB I site substituting for the Pvu II 
site in exon 1 of BLG. 



35 



D. 



CLONING OF SHEEP GENOMIC SEQUENCES EXTENDING 
UPSTREAM OF THE 5'-BLG SEQUENCES IN PREVIOUS BLG 
VECTORS (p639. p642) 
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7he 5-sequences, upstream of the BLG coding sequence, within the 
previously discussed BLG vectors encompass a genomic DNA fragment of 
approximately 3 Kb. This 3 Kb region includes the BLG promoter sequences. 
In order to determine if sequences upstream of this 3 Kb fragment contain 
5 elements which would increase the consistency and/or level of expression from 
the BLG promoter, upstream genomic sequences were cloned. 

High molecular weight sheep liver DNA was subjected to Southern 
analysis utilizing a variety of restriction enzymes. The very 5'-end of the BLG 

10 sequences already obtained, i.e. the Sal I to Pvu II fragment (approximately 
450 bp) hybridized to repetitive sequences in genomic Southerns. Therefore, 
the Pvu Il-Pvu li fragment (approximately 500 bp) downstream of the Sal l-Pvu 
II fragment and the Pvu ll-BamH I fragment (approximately 600 bp) downstream 
of the Pvu Il-Pvu II fragment were used as a probe. These probe fragments 

15 were generated by digestion of p570 with BamH I and Pvu II, gel and elutlp 
purification of the appropriate 500 and 600 bp fragments, and ethanol 
precipitation. Probe DNA was 32 P-Iabeled by the random primer technique. 

Southern analysis revealed a probe positive Hind III fragment 
20 (approximately 8 Kb) and a probe positive Sac I fragment (approximately 8.6 
Kb). This allowed us to map Hind III and Sac I sites to approximately 2.5 Kb 
and 7.8 Kb, respectively, upstream of the 5-end (EcoR I site) of the previously 
obtained BLG sequences. 

25 High molecular weight sheep DNA was digested extensively with either 

Sac I or Hind III. Restriction fragments were resolved on 0.6% agarose gels. 
An analytical strip of each gel, made up of size markers and about 5% of the 
restricted sheep DNA was cut from the rest of the gel, stained with ethidium 
bromide and photographed under UV light. The rest of the gels, the 

30 preparative portions, were wrapped in plastic wrap and put at 4°C until ready 
for use. The analytical portions were subjected to Southern analysis using the 
32p-| a beled probe discussed above. The probe positive Hind III (8 Kb) and 
Sac I {8.6 Kb) fragments were identified. Corresponding regions were cut out 
of the preparative portions of the gels. DNAs were electroeluted, elutip isolated 

35 and ethanol precipitated. 
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Vector Lambda Zap II (Stratagene) was used for construction of 
subgenomic libraries for both the Hind III and Sac I generated fragments. For 
the Sac I subgenomic library, Lambda Zap II was digested with Sac I and 
dephosphorylated with CIP. For the Hind III subgenomic library, Lambda Zap II 
5 was first self ligated in order to ligate cos ends together. It was subsequently 
digested with Spe I and partially filled in with Klenow polymerase and dCTP 
and dTTP. 

The purified Hind III subgenomic DNA was partially filled with Klenow 

1 0 polymerase and dGTP and dATP. these partially filled Hind III ends were 
compatible with the partially filled Lambda Zap II Spe I ends. The Hind III 
subgenomic DNA was ligated into the Spe I digested X vector and the Sac I 
subgenomic DNA was ligated into the Sac I digested X vector. Each was 
packaged with Gigapack Plus II extracts (Stratagene) and plated on PLK-A 

15 cells (Stratagene). Plates were lifted, in duplicate, onto nitrocellulose filters. 
Filters were treated as discussed in the library screen for X22-1 and X10-1 . 
Filters were hybridized with the 32 P-labeled probe discussed earlier in this 
section. Filters were washed. Duplicate positive plaques were subplaqued to 
purity with the final, most stringent wash being with 0.5xSSC, 0.2%SDS, 62°C. 

20 The pBiueScript SK phagemid containing BLG sequences were in vivo 
excised from the Lambda Zap II positives clones using R408 helper phage 
(Stratagene) by the method recommended by the supplier. The phagemids 
were rescued on XL1-Blue cells (Stratagene) and selected on LB-ampicillin 
plates. Selected colonies were cultured in LB-ampicillin medium and plasmids 

25 from these cultures were subjected to restriction analysis. 

Correct clones of the Hind III subgenomic cloning were initially identified 
by the presence of an approximate 4.4 Kb EcoR I fragment and approximately 
1900 and 1075 bp Asp 718 (isoschizomer of Kpnl) fragments all of which had 

30 been previously delineated by mapping of the BLG region already cloned. 
Both orientations of the BLG Hind III subgenomic fragment cloned into the Spe 
I site of the vector were found. The desired orientation with the 5'-end of BLG 
Hind III fragment cloned into the Spe I site towards the plasmld multiple cloning 
Sal I site was characterized by the release of a fragment of approximately 2500 

35 bp upon digestion with EcoR I and Hind III. This desired clone of the Hind III 
subgenomic DNA was designated p639. The 5'-end of the BLG region was 
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extended approximately 2.5 Kb further upstream from the 5-end of the original 
BLG clone (e.g. p585). 

Correct clones of the Sac I subgenomic cloning were initially identified 
5 by the release of a fragment of approximately 2500 bp upon digestion with 
EcoR I and Hind III and a 3800 bp fragment upon digestion with Sac I and Hind 
111. The clones with the desired orientation of BLG sequences within the 
pBIueScript SK plasmid were identified as follows. Sac I subgenomic clones 
were digested with EcoR I and subjected to Southern analysis using the 32 P- 

1 0 labeled p570 BamH l/Pvu II probe homologous to the 5'-end of the p570 BLG 
sequences and therefore to the 3'-end of the BLG sequences within the Sac I 
subgenomic fragment described above. A probe positive EcoR I fragment of 
approximately 4.2 Kb identified the plasmid as being the desired orientation 
with the upstream most end of the Sac I fragment being next to the plasmid 

15 multiple cloning site including the Sal I site. This correct, desired clone was 
designated p642. 

Sequencing confirmed that p639 (Hind III subgenomic) and p642 (Sac I 
subgenomic) contained BLG upstream sequences which overlapped the 

20 original BLG clones (e.g., p571). The S'-sequence of the original BLG 5- 
portion cloned in p571 was determined using an Sp6 promoter primer of the 
sequence, 5' ATTTAGGTGACACTATA 3\ The sequence was further 
extended into the BLG sequences of p571 by using a primer, 
5 l TGTTTGGGGACTTCCCTGGTGA 3', derived from sequence obtained using 

25 the Sp6 promoter primer. 

The sequences obtained using the second primer were identical in the 
p571 r p639 and p642 construction confirming that the latter two clones 
contained BLG upstream sequences. In addition a third primer, 
30 3' AGTCCCACTACGACCGGAG 5\ derived from sequence obtained from the 
Sp6 promoter primer was used to obtain sequence upstream of the 5 - EcoR I 
site in the original BLG clone. As expected sequences obtained from both 
p639 and p642 were identical and the proximal most 25 bases were identical 
with the 5-most bases of p571 and contained the natural EcoR I site. 

35 

E. CONSTRUCTION OF COMPLETE BLG VECTORS WITH BLG CODING 
REGIONS WITH EXTENDED S'-SEQUENCES (p644, p646) 
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The 5-extended BLG sequences cloned as Hind III and Sac I sheep 
DNA subgenomics in plasmids p639 and p642, respectively, were 
incorporated into full length vectors capable of expressing BLG as follows. 

5 

For the incorporation of sequences contained within the Hind III 
fragment, the S'-extension was first recombined with the 5-BLG region in 
plasmid p590. p590 was digested to completion with Asp 718 and partially 
with Sal I. The DNA fragment of approximately 4940 bp which included BLG 

1 0 sequences downstream of the Asp 71 8 site through the Sal I site and adjacent 
pGEM sequences was gel and elutip purified and ethanol precipitated. The 5'- 
end of the BLG Hind III fragment (approximately 4600 bp) was released from 
p639 by digestion with Sal I (in the polylinker, just upstream of the 5'-end of the 
Hind III fragment), and Asp 718 (downstream of the original BLG 5'-EcoR I site), 

15 gel and elutip purified, and ethanol precipitated. The prepared p590 and p639 
fragments were ligated together. Ligation products were transformed into JL 
CttJi DH5 cells to ampicillin resistance. Correct recombinants were diagnosed 
by the fact that restriction by EcoR I produced two fragments (approximately 
7090 and 2500 bp). Sal I produced two fragments (approximately 6758 and 

20 2835 bp ) and Bgl II linearized plasmid to the size of approximately 9590 bp. 
Correct clones were designated p640. Plasmid p640 contains a BLG 5-region 
of approximately 5.5 Kb from the upstream Hind III site to the transcriptional 
initiation site just upstream of the SnaB I cloning site. This 5.5 Kb 5'-region 
was recombined with complete BLG coding and 3-regions. Plasmid p640 was 

25 digested with Pvu I (within the pGEM) and SnaB I releasing a DNA fragment of 
approximately 7043 bp made up ofpart of the pGEM vector and the 5.5 Kb BLG 
5-region. Plasmid p585 was also digested with Pvu I and SnaB I releasing a 
DNA fragment (approximately 7041 bp) comprised of the rest of pGEM and the 
BLG coding and 3'-regions. These fragments were gel and elutip purified and 

30 ethanol precipitated and subsequently ligated together. Ligation products 
were used to transform DH5 cells and positive transformants selected on LB- 
ampicillin plates. Hind III digestion resulting in 3 DNA fragments 
(approximately 7840, 3410 and 2830 bp) identified correct recombinant clones 
consisting of 5'-BLG sequences (approximately 5.5 Kb), BLG coding 

35 sequences and 3'- BLG sequences. Correct clones were designated p644. 
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The 5'-BLG sequences contained within the sheep DNA Sac I clone, 
p642, were incorporated into a full length BLG vector by first joining these 
sequences to the 5'-BLG sequences contained within plasmid p640 which 
possesses 5'-BLG sequences derived from the Hind III clone p642. Plasmid 
5 p640 was digested with EcoRV which restricted it at 2 sites. One EcoRV site 
was contained within the pGEM polylinker just upstream of the 5'-end of the 
Hind III BLG sequences. The second EcoRV site was just upstream of the 
original 5-BLG EcoR I site. The resultant DNA fragment of approximately 7150 
bp was gel and elutip purified and ethanol precipitated. The ends of this 

10 fragment were then dephosphorylated with calf intestinal alkaline phosphatase 
(CIP) (Promega). Into this dephosphorylated EcoRV site was ligated BLG 
sequences which extended from the common EcoRV site just upstream of the 
original y-EcoR I site up to the EcoRV site. This latter DNA fragment 
(approximately 7700 bp) was obtained by digestion of p642 with EcoRV and 

1 5 subsequent gel and elutip purification and ethanol precipitation. Ligation 

products were transformed into DH5 cells and positive transformants selected 
on LB-ampicillin plates. Clones with the correct, desired orientation of the 
p642 insert into p640 were characterized by the presence of two Bgl II DNA 
fragments (approximately 10,140 and 4710 bp). A Bgl II site had been mapped 

20 to about 600 bp downstream of the 5'-Sac I site by Southern analysis of sheep 
DNA. Correct clones were designated p645. p645 clones contain 5 f -BLG 
sequences of approximately 10.8 Kb upstream of the BLG transcriptional 
initiation site, followed by the SnaB I site, the 3*-BLG region containing the 
native BLG polyadenylation signal and site and the pGEM bacterial plasmid. 

25 p645 does not contain BLG coding sequence. p645 was used to produce a full 
length BLG vector with 10.8 Kb 5-BLG sequences. p645 was digested with 
SnaB I and Pvu I (within pGEM). The DNA fragment of approximately 12,290 
bp, made up of bacterial plasmid sequences as well as all 10.8 Kb 5 1 - 
sequences upstream of the SnaB I site, was gel and elutip purified and 

30 ethanol precipitated. The BLG coding sequence and BLG 3-sequences as 
well as bacterial plasmid were obtained as a SnaB I, Pvu I DNA fragment 
(approximately 7040 bp) from p585 similarly prepared by gel purification. 
These two DNAs were ligated together and transformed into E^cali DH5 cells 
to ampicillin resistance. Correct recombinants produced DNA fragments of 

35 approximately 14,040, 2860 and 2430 bp upon Xba I digestion and were 
designated p646. 
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The 2 purified Nco I subgenomic fractions were individually ligated into 
Lambda Zapll vector, digested with EcoR I and dephosphorylated with alkaline 
phosphatase (Stratagene, Inc.) using 2 synthetic oligonucleotides which when 
5 annealed form an adaptor of the structure: 

T WmT FlnoR I 

Oligonucleotide A: 5' CATGGCAACGATCGCGAGTCGACG 3' 

10 Oligonucleotide B: 3' CGTTGCTAGCGCT£A££I£CTTAA 5' 

Sal I 

Prior to annealing the 5'-end of oligonucleotide B was phosphorylated 
using T4 polynucleotide kinase by standard procedures, so as to supply a 5- 

1 5 phosphate group required for the ligation of the EcoR I site of the adaptor to the 
dephosphorylated EcoR I site of the lambda vector. The S'-phosphate group of 
the Nco I site of the subgenomic human DNA fractions allows ligation of this 
site to the nonphosphorylated Nco I site of the adaptor. Ligation products were 
packaged into phage using Gigapack II Plus packaging extract (Stratagene, 

20 Inc.) according to the suppliers protocols. 

Libraries were produced by the adsorption of packaged phage into PLK- 
A cells (Stratagene, Inc.) and plated. Approximately 100,000 plaques of the 
library produced from the Nco I subgenomic fraction containing the HSA 8079 

25 bp fragment and 50,000 plaques of the library produced from the Nco I 

subgenomic fraction containing the HSA 9374 bp fragment. Plates were lifted 
in duplicate onto nitrocellulose filters. Fitters were treated by standard 
procedures, prehybridized (5 x SSPE, 1/25 Blotto, 0.2% SDS) and hybridized 
overnight at 62°C with the same buffer containing the 32 P-labeled HSA cDNA 

30 probe discussed above. Filters were washed and subjected to 

autoradiography. Duplicate positive plaques were identified and subplaqued 
to purity. The pBluescript SK plasmid component of the Lambda Zapll vector, 
containing HSA gene inserts, were in vivo excised from the vector using the 
R408 helper phase supplied by Stratagene, Inc. according to their protocols. 

35 Plasmids containing the 5'-half of the HSA gene were designated p651. 
Piasmids containing the 3-half of the HSA gene were designated p650. 

Restriction analysis of p650 confirmed that this clone in fact represented 
the insertion of the Nco 1 5*-half of the HSA gene into the EcoR I site of 
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pBluescript SK. Digestion of p650 with the following restriction enzymes 
yielded the expected DNA fragments. 

Restriction Enzyme Expected & Observed DNA Fragments (bp) 
5 Bglll 5363; 4921 ; 754; 47 

BstEII 11085 

EcoR I 3541 ; 2958; 2008; 1 603; 709; 266 

Hind III 6564; 4281 ; 240 

Nco I 8079; 3006 

10 Nco I + BstEII 7756; 3006; 323 

Seal 251 1 ; 2449; 2405; 2368; 1 352 

Xba I 6926; 1882; 1296; 981 

The expected locations of the restriction sites were derived from a 
1 5 restriction map of pBluescript SK (Stratagene, Inc.) and the SEQ restriction 
analysis program of Intelligenetics, Inc. of the sequence of the appropriate 
region of the HSA gene as published by Minghetti et aL (1986, J. Biol. Chem. 
212:6747-6757). The restriction analysis of p650 also determined that the 
HSA gene region was cloned into the pBluescript vector with its 5'-end toward 
20 Kpnl site of the vector multiple cloning site. 

It had been expected that the restriction of p651 with Nco I would result 
in 2 fragments of 9374 and 3006 bp; the former representing the Nco I 3'-half of 
the HSA gene and the latter representing the pBluescript sequences into which 

25 the HSA sequences were cloned. However, while the 3006 bp (pBluescript) 
band was visible, the 9374 bp band was not found. Instead a band of 
approximately 4000 bp was present. In order to determine whether this band 
represented HSA sequences, p651 was subjected to sequencing using 
several sequencing primers (synthesized in house) using either the 

30 Sequenase Sequencing kit (United States Biochemical Corp.) or the 

"^Sequencing kit (Pharmacia) according to manufacturer's protocols. The 
EcoR I site within pBluescript into which the cloned sequences were introduced 
is flanked by a T7 promoter on one side and a T3 promoter on the other side. 
We therefore used T7 (sense) and T3 (antisense) sequencing primers in order 

35 to sequence the termini of the cloned sequences. HSA exon 7, 8, 9, and 1 1 
specific primers (sense) as well as two exon 12 specific primers (sense) and an 
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exon 15 specific primer (antisense) were also used. Primer sequences were 
as follows: 



Primer 



Sequence 



Corresponding 
HSA gene 
bp position# 



T7 5' AATACGACTCACTATAG 3' 

T3 3' GAAATCACTCCCAATTA 5' 

1 0 HSA Exon 7 5' CATGGAGATCTGCTTGAA 3' 

HSA Exon 8 5' GACTTGCCTTCATTAGCT 3' 

HSA Exon 9 5' GAGAAGTGCTGTGCCGCT 3' 

HSA Exon 1 1 5' GTACCCCAAGTGTCAACT 3* 

HSA Exon 1 2(A) 5' GACAGAGTCACCAAATGC 3' 

1 5 HSA Exon 1 2 (B) 5' GAGAG ACAAATCAAGAAAC 3' 

HSA Exon 1 5 3' AGTCGGATGGTACTCTTATTCTC 5* 



9541 - 9558 
10996- 11013 
12566- 12583 
15002- 15019 
15588- 15605 
15735- 15753 
18522- 18544 



20 



Specific sequence information from all primers except the HSA exon 8,9 
and 11 specific primers was obtained suggesting that these latter regions were 
lacking in p651 . Sequences obtained from sequencing reactions using the 
primers corresponded to HSA gene sequences as indicated below. 



25 Primer 

T7 
T3 



30 



35 



40 



Sequence obtained from primers corresponding to 
HSA gene hp position and region 



HSA exon 7 
HSA exon 12(A) 
HSA exon 12(B) 
HSA exon 15 



9540 - 9737 
18919- 18669 

9605 - 9616 
15681 - 15690 
15796-15807 
18497-18156 



(Part of exon 7 and into intron 7) 

(3-flanking region of exon 15 and 

into exon 15) 

(Intron 7) 

(Exon 12) 

(Intron 12) 

(Intron 14) 



Clearly, the 5 - and 3-ends of the 3-half of the HSA gene, including Nco 
I sites at each end, as well as exon 7 and at least part of intron 7 from the 5*- 
end and the 3'-sequences flanking exon 15 and exon 15 through exon 12 from 
the 3 -end. These results demonstrate that the 3'-half of the HSA gene was 
present in p651 , but that an internal deletion of HSA sequences had occutxed. 
The deletion was also found in the parent Lambda phage. Therefore, construct 
p651 contains the HSA gene from exon 12 through exon 15 and beyond into 
3'-f tanking sequences to the Nco I site, including introns 12, 13 and 14. 
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Restriction analysis of p651 demonstrated that the Asp I site within exon 1 2 
was present. 

In order to clone the HSA gene sequences between exon 7 and exon 12 
5 so as to obtain that, region which includes introns 7-1 1 , we utilized PCR 
technology. Four PCR reactions were set up using synthetic priming 
oligonucleotides homologous to desired regions of the HSA gene containing 
useful restriction sites. The PCR reactions were designed so that the upstream 
end of reaction #1 overlapped the Nco I site within exon 7. The downstream 

1 0 end of reaction #1 overlapped the Avr II site within intron 8. This same Avr li 
site was overlapped by the upstream end of PCR reaction #2. The downstream 
end of PCR reaction #2 overlapped the Hind III site within intron 8. This Hind III 
site was also overlapped by the upstream end of PCR reaction #3. The 
downstream end of PCR reaction #3 overlapped the Xhol site in Intron #10 

1 5 which was also overlapped by the upstream end of PCR reaction #4. The 
downstream end of PCR reaction #4 overlapped the Asp I site in exon 12. By 
this PCR strategy the entire region desired would be obtained and adjacent 
PCR products would be joined together using overlapped restriction sites. PCR 
primers and reactions were as follows: 
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BamHI NCO I 

Sense primer 5' gaatt cggatccC CATGGAGATCTGCTTGAATGTGCT 3' 

* * 
9540 9564 



Antisense 
1 0 primer 



AvrIX 

TGTTGAATCCTCCTATCGGATCCcagctg 



* 



11127 11149 



15 



20 



PGR #2 



Sense primer 



Antisense 
primer 



Avrll 

5 1 gtcgacCCTAGGCTTTTCTGTGGAGTTGCT 3 1 
* * 
11144 11167 

Hind 1X1 

3 1 TTCAGGAATCGATGATTCGAActtaag 5 f 
12339 12359 



25 PGR #3 

Sense primer 



30 



35 



40 



45 



Antisense 
primer 



PGR #4 

Sense primer 



Antisense 
primer 



Hind III 

• gaattcAAGCTTTACTGCATGGGGTTTAGT 3 * 
★ * 
12354 12377 

XhoZ 

1 TGTTCTGTATCAAAGAAAGGAGCTCcagctg 5 ' 

* * 
14454 14478 



Xhol 

5 1 gtcgacCTCGAGTAGATTAAAGTCATACA 3 * 
* * 
14473 14495 

Aspl BamHI 
3 ' TTGCGGTCATTCACTGTCTCAGtecTAGGCTTAAC 5 1 



15575 



15596 



Note 

Restriction sites in bold are tound within HSA sequences. Non-HSA sequences are 
shown as small letters. Asterisks and associated numbers mark the corresponding HSA gene bp 

50 position. 
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PCR reactions were performed with high molecular weight human 
placental DNA as substrate [40 cycles of 94° C (1\ 30"), 60° C (2'). 72° C (4')]. 
A sample of each reaction was analyzed on a 1% agarose gel. DNA bands of 
expected size (PCR #1 , 1628 bp; PCR #2, 1228 bp; PCR #3, 2136 bp; PCR #4, 
5 1 142 bp) were seen. The remainder of the reactions were extracted two times 
with chloroform, 2 times with phenol/chloroform , 2 times with chloroform and 
subsequently ethanol precipitated. Products of reactions #1 and #2 were 
digested with Avr II. Products of reactions #3 and #4 were digested with Xhol. 
DNAs were subjected to agarose gel electrophoresis and DNA bands cut from 

10 the gel, electroeluted and elutip purified. The purified products of reactions #1 
and #2 were ligated together and products of reactions #3 and #4 were ligated 
together. Since each PCR product had available 5*-phosphates only at their 
digested ends (Avr II or Xhol) only these ends were available for ligation and 
the other end of each DNA was not available for ligation. Ligation products of 

1 5 reactions #1 and #2 as well as ligation products of reactions #3 and #4 were 
subsequently each digested with BamH I and Hind III. The resultant 2814 bp 
DNA representing the ligation of reactions #1 and #2 at their common Avr II site 
with a digested BamH I site at its upstream terminus and a digested Hind III site 
at its downstream terminus was gel and elutip purified. This DNA was ligated 

20 into the BamH I and Hind III sites of plasmid vector pQEM-1 . The 3243 bp DNA 
representing the ligation of reactions #3 and #4 at their common Xhol sites, 
with its upstream terminus digested at its Hind III site and downstream terminus 
digested at its BamH I site was similarly purified and ligated into the Hind III 
and BamH I sites of plasmid vector pGEM-2. Ligation products were 

25 introduced into MC1061 bacterial host cells by electroporation (BTX 

Electroporation System 600) according to manufacturers protocols. Positive 
transformants were selected on ampicillin plates. Restriction analysis of DNAs 
from selected transformants identified desired clones. Construct p679 was the 
designation of the HSA PCR #1 and #2 products cloned into pGEM-1 . It 

30 contains HSA sequences extending from the Nco I site in exon 7 (HSA gene 
bp position 9541) to the Hind III site in the downstream end of intron 8 (HSA 
gene bp position 12355). The pGEM-2 construct containing HSA PCR #3 and 
#4 products was designated p676. It contains HSA sequences extending from 
the same Hind III site in the downstream end of intron 8 (HSA gene bp position 

35 12355) to the Asp I site in exon 12 (HSA gene bp position 15592). 
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When taken together with construct p650, containing the HSA gene 
sequences from the Nco I site upstream of HSA exon 1 (HSA gene bp position 
1462) to the Nco I site in exon 7 (HSA gene bp position 9541) and construct 
p651 , containing HSA gene sequences extending from upstream of the Asp I 
5 site in exon 12 (HSA gene bp position 15592) to the Nco I site downstream of 
exon 15 (HSA gene bp position 18915) these constructs, p679 and p676, 
complete the cloning of the entire HSA gene. 



10 



Example 3 

COMPLETE BLG VECTOR WITH THE HSA cDNA 
UNDER THE CONTROL OF THE BLG PROMOTER (p575) 



Vectors based upon use of the BLG promoter and gene system 
1 5 incorporate the foreign gene to be expressed (HSA) into the Pvu II site of the 
BLG gene within exon 1 just upstream of the BLG translational initiation ATG 
codon. There are a number of Pvu II sites within the BLG sequences. The BLG 
5'-region cloned into pGEM-1 (i.e. plasmid p570) possesses three Pvu II sites 
including the appropriate site within BLG exon 1 . The other two Pvu II sites 
20 were located approximately 2100 and 2600 bp, respectively, upstream of the 
Pvu II site within exon 1 . In order to introduce the HSA cDNA into the 
appropriate Pvu II site of a complete BLG vector, the cDNA was first cloned into 
the correct site within the BLG 5'-clone (p570) and subsequently the BLG 5'- 
clone with HSA cDNA was joined to the BLG 3'-clone. Plasmid, p570, was 
25 partially digested with Pvu II. Linearized plasmid (approximately 7200 bp) was 
recovered by gel and elutip purification and ethanol precipitation. 

The HSA cDNA was isolated from a lambda gt1 1 -human liver cDNA 
library. The complete cDNA clone is 1,983 bp in length and contains the 

30 complete HSA coding sequences, including the 18 amino acid propeptide, the 
6 amino acid propeptide, and the 585 amino acids of the mature protein. The 
cDNA clone also contains 20 bp of 5-untranslated and 141 bp of 3- 
untranslated sequence. The cDNA clone was subcloned into the plasmid 
vector pBS(-) (Stratagene) between the vector BamH I and EcoR I polylinker 

35 restriction sites with the BamH I site at its 5'-end and EcoR I site at its 3'-end. 
This plasmid was referred to as pHSA-F1-. The HSA cDNA sequence was 
released from the bacterial vector by restriction with BamH I and EcoR I. These 
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staggered ends were blunted with Klenow polymerase in the presence of 
excess dNTPs. The blunt ended HSA cDNA was ligated into the purified Pvu II 
linearized plasmid p570 discussed earlier. Ligation products were introduced 
into DH5 cells by transformation to ampicillin resistance. Desired 
5 recombinants were diagnosed by the presence of two Hind III restriction 
fragments (approximately 7807 and 1293 bp) and three BamH I restriction 
fragments (approximately 4280. 3170, and 1700 bp). This confirmed that the 
HSA cDNA was introduced into desired Pvu II site of BLG, in exon 1 , in the 
same orientation as the BLG coding sequence and that the junction between 
1 0 the BLG Pvu II site and the Klenow blunted 5-HSA BamH I site regenerated a 
functional BamH I site. Additional analyses verified that the junction between 
the 3-HSA EcoR I site blunted with Klenow polymerase and the BLG Pvu II had 
regenerated a function EcoR I site. This correct plasmid clone was designated 
p572. 

15 

The 5'-region of the BLG gene with the HSA cDNA cloned into its Pvu II 
site in exon 1 was recombined with the 3'-region of the BLG gene as follows. 
Construct p572 was digested to completion with Pvu I (within pGEM) and 
partially with EcoR I. The desired fragment of approximately 7800 bp 

20 containing pGEM sequences and the 5'-region of the BLG gene to the junction 
of its 3'-end (EcoR I) with the pGEM-1 was gel and elutip purified and ethanol 
precipitated. This was ligated to the 3'-region of the BLG gene, released from 
plasmid p568, as a Pvu l/EcoR I fragment of approximately 5800 bp. This latter 
fragment contains pGEM-1 sequences up to its Pvu I site, and the 3-region of 

25 the BLG gene up to its 5'-junction (EcoR I) with pGEM. Ligation products were 
used to transform DH5 cells to ampicillin resistance. 

Correct, desired recombinants were characterized by Hind III fragments 
of approximately 7840, 3500 and 2200 bp and BamH I fragments of 

30 approximately 4760, 4244, 2000, 1700 and 870 bp. Correct recombinants 
were designated p575. These represent the complete BLG sequences utilized 
in the vectors including 5' and 3' regions, with the HSA cDNA introduced into 
the BLG Pvu II site just upstream of the BLG ATG translation^ initiation codon, 
in the same orientation as the BLG coding sequences, within a pGEM plasmid. 

35 BLG/HSA and HSA/BLG junctions were confirmed by sequence analysis. The 
BLG/HSA sequences are flanked by Sal I sites. 
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As Sal I sites are not found anywhere else in the BLG/HSA sequences, 
digestion of p575 with Sal I releases a BLG/HSA fragment, from the pGEM, 
suitable for injection into mammalian oocytes for the production of BLG/HSA 
transgenics. 

5 

FxamDlQ 4 



10 



20 



INTRODUCTION OF AN HSA MINIGENE CONTAINING 

THE FIRST HSA INTRON INTO A COMPLETE BLG VECTOR (p599) 



In order to elucidate the HSA intron pattern which would result in high 
level expression of HSA in the milk of transgenics, HSA minigenes were 
constructed composed of the HSA cDNA with different combinations of its 
introns. One such minigene involves the incorporation of the first HSA intron 
1 5 into an HSA cDNA. In order to make such a construct, a Cla I restriction site 
was introduced into the region of the HSA cDNA which is derived from HSA 
exon 2, without changing its coding sequence. This was accomplished by in 
vitro m utagenesis (Amersham kit) using single stranded DNA template derived 
from pHSA-F1- and a synthetic oligonucleotide of the sequence, 



- rcftTTSCTC! ATCGATT TAAAGATTTGGG-3 ' 
Cla I 



25 This mutagenesis introduced the Cla I site by replacing the HSA cDNA 

G with an A in the third base position of the codon for arginine, the 34th amino 
acid of the HSA protein (including the prepro sequence). Both CGA and CGG 
codons encode arginine. The resultant clone, made up of an HSA cDNA 
containing an introduced Cla I site within its exon 2 derived region in the pBS 

30 vector, was designated p595. 

A DNA fragment composed of intron 1 of the HSA gene, along with parts 
of flanking exon 1 and exon 2, was produced by PCR technology using 
synthetic oligonucleotide primers which included sequences complementing to 
35 exon 1 and exon 2 as seen below. 
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Exon 1 Ssnsp Primer 

HinH TTT He^ Ba'tETT 

5 5 1 H5TACATAAG CTTTGGCACAATGAAGTGGGTAACCTT -3 ' 

Exon 1 sequence 



10 



15 



Exon 2 Antisense Primer 
Cla I 



3 ' -CCAACGAGTAGCTAAATTTCTAAArrr-S • 



* 



Exon 2 sequence 

•Introduced Cla I site by incorporating a T in this position rather than a C. No 
change in coding as discussed above. 



The template for this PCR reaction was a clone of the HSA gene 
20 extending from the gene exon 1 to exon 3, including introns 1 and 2. This 

clone designated p594 will be discussed later. Following ethanol precipitation, 
the PCR product of 844 bp was digested with Bst Ell and Cla I. The - 
subsequent DNA fragment of 799 bp was gel and elutip purified and ethanol 
precipitated. 

25 

Plasmid p595 was digested with BstEII and Cla I. The large fragment 
lacking the BLG cDNA region between the BstEII and Cla I sites was gel 
purified, electroeluted, elutip isolated and ethanol precipitated. 

30 The purified PCR product with BstEII and Cla I ends and the purified 

p595 fragment with BstEII and Cla I ends were ligated together. Ligation 
products were transformed into DH5 ceils and positive transformants selected 
on LB-ampicillin plates. 



35 Transformants containing plasmid containing the PCR insert were 

identified by colony lifts of plates using Whatman 541 filters as described 
earlier. The probe was 32 P-labeled random primer product of the PCR product 
used in the ligation. Following autoradiography of filters, probe positive 
colonies were picked, grown in LB-ampicillin medium. Plasmid preparations 

40 from these were analyzed separately with Cla I and Xba I. Correct plasmids 
which possessed a Cla I site and produced Xba I fragments of 4010 and 1874 
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bp were identified. A correct recombinant made up of an HSA minigene 
comprised of its cDNA and HSA intron 1 within a pBS vector was designated 
p596. 

5 The HSA (intron 1 ) minigene was introduced into BLG vector p585 as 

follows: The HSA (intron 1) minigene was released from the pBS vector by 
complete digestion with BamH I and partial digestion with EcoR I. The 2701 bp 
minigene was gel and elutip purified and ethanol precipitated. The BamH I 
and EcoR I ends of the minigene were blunted using Klenow polymerase and 

1 0 excess dNTPs. This blunted fragment was ligated into BLG vector p585 which 
had been digested with SnaB I and dephosphorylated with CIP. Ligation 
products were transformed into DH5 cells to ampicillin resistance. Positive 
colonies were identified by colony lifts with Whatman 541 filters and 
hybridization to the HSA intron 1 PCR probe describe above. Probe positive 

1 5 colonies identified by autoradiography were grown in LB-ampicillin medium 
from which plasmid preparations were made. Clones which possessed the 
HSA (intron 1) minigene in the desired orientation, that is with the HSA 
minigene in the same orientation as the BLG gene, were identified by their 
release of restriction fragments of approximately 10676 and 3600 bp upon 

20 EcoR I digestion and approximately 4490, 3600, 3400 and 2860 bp upon EcoR 
I/Sal I double digestion. This desired clone, the HSA (intron 1) minigene in the 
correct orientation within the SnaB I site of BLG vector p585 was designated 
p599. 

25 Example 5 

A. CONSTRUCTION OF A BLG VECTOR LACKING CODING SEQUENCE 
(p590) 

30 The full length BLG vector was altered so that while it still contains the 

5-BLG sequences, including promoter, upstream of the BLG coding sequence 
and 3 -BLG sequences, including the BLG polyadenylation signal and site, 
downstream of the coding sequence, all BLG coding sequence was deleted. 
Plasmid p583. containing the 5-region of BLG with a SnaB I site replacing the 

35 Pvu II site in BLG exon 1 , was digested with SnaB I and Pvu I (within pGEM). 
The DNA fragment (approximately 4490 bp) containing pGEM sequences and 
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the 5'-portion of the BLG gene up to the SnaB I site, was gel and elutip purified 
and ethanol precipitated. 

Restriction mapping of plasmid p568, BLG 3'-portion in pGEM-1 , 
demonstrated that the Xma I (isoschizomer of Smal) site within exon 7 of the 
BLG gene was the 3'-most Xma I site within the 3'-portion of the gene. Plasmid 
p568 was digested with Xma I and Pvu I (within pGEM). The DNA fragment of 
approximately 2660 bp containing pGEM sequences and the 3'-end of the 3'- 
portion of the BLG gene (up to the Xma I site within exon 7) was gel and elutip 
purified and ethanol precipitated. 

Two synthetic oligonucleotides were produced that when annealed 
formed a SnaB l/Xma I adaptor of the sequence. 

JL 

-SnaB T Xmal 

5' - GTAGATCTC 3' 
3' - CAI£IAj3A.GGGCC 5' 
Bgl II 

A ligation between the purified fragments from p583, p568 and the 
annealed oligonucleotide adaptor was performed. Ligation products were 
transformed into DH5 cells to ampicillin resistance. Correct recombinants were 
identified by the fact that Bgl II digestion linearized the plasmid, that BamH I 
digestion resulted in 3 fragments (approximately 4200, 2050 and 900 bp) and 
25 that SnaB I and Hind III digestion produced fragments of approximately 5950 
and 1200 bp. These correct recombinants containing the 5'- and 3'-ends of the 
BLG gene connected by a SnaB I site but without BLG coding sequence were 
designated p590. 

30 B. CONSTRUCTION OF BLG VECTOR (BLG 5-AND 3'-SEQUENCE, NO 
BLG CODING SEQUENCE) WITH AN HSA MINIGENE CONTAINING 
HSA INTRON 1 (p600) 

Construct p590 was digested with SnaB I and resultant restricted ends 
35 dephosphorylated with CIP. The HSA minigene with intron 1 was released 
from p596 by complete digestion with BamH I and partial digestion with EcoR I. 
The 2701 bp fragment composed of the entire minigene was gel and elutip 
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purified and ethanol precipitated. The BamH I and EcoR I ends of this fragment 
were made blunt using Klenow polymerase and excess dNTPs. 

The prepared minigene was ligated into the prepared p590 vector. 
5 Ligation products were introduced into iLfiPi DH5 ce,ls bv transformation to 
ampicillin resistance. Plates containing selected colonies were lifted onto 
Whatman 541 filters and processed as before. A 3 2 P-labeled probe, the PCR 
produced HSA intron 1 discussed in the section describing the construction of 
plasmid 599, was made by the random primer technique. Hybridization and 

1 0 subsequent filter washes were as described for the colony lifts for the 

identification of p599 positives. Following autoradiography, probe positive 
colonies were picked and grown in LB-ampicillin medium. Plasmid 
preparations from these were subjected to restriction analysis. Correct 
recombinants, with the HSA minigene cloned into the SnaB I site in the same 

1 5 orientation as the BLG promoter, were identified by the production of a single 
EcoR I fragment (approximately 9800 bp) and EcoR I/Sal I double digest 
fragments of approximately 351 1 , 3419 and 2850 bp and were designated 
p600. 

20 Example 6 

A. CONSTRUCTION OF BLG VECTOR WITH AN HSA MINIGENE 
CONTAINING HSA INTRONS 1 AND 2 (p607) 

25 In preparing this construct the HSA cDNA was subcloned into pGEM-1 . 

pGEM-1 was digested with Pvu II Oust outside of the polylinker) and EcoR I 
(within the polylinker). The 2819 bp vector fragment was gel and elutip purified 
and ethanol precipitated. The HSA cDNA subcloned in the plasmid vector 
pBS* (pHSA-F1-) as described in the section on the construction of p575, was 

30 released from pHSA-F1- by digestion with Sal I which was blunted with Klenow 
polymerase and excess dNTPs and subsequently digested with EcoR I. The 
2004 bp fragment of the HSA cDNA was gel and elutip purified and ethanol 
precipitated. This cDNA fragment was ligated into the purified pGEM-1 plasmid 
fragment digested with Pvu II (blunt) and EcoR I. Ligation products were 

35 transformed into DH5 cells and positive transformants selected on LB- 
ampicillin plates. Selected colonies were lifted onto Whatman 541 filters as 
previously described. A 32 P-labeled HSA cDNA probe was produced by the 
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random primer technique using a 1941 bp Hind III fragment of pHSA-F1" as a 
template. Hybridization was in standard buffer at 65°C. Washing conditions 
included a final, most stringent wash of 0.5 x SSC, 0.2% SDS at 65°C. Probe 
positive colonies were picked, grown in LB-ampicillin medium. Plasmids 
5 obtained from these cultures were subjected to restriction analysis. Correct 
recombinants were characterized by the presence of 281 8 and 201 1 bp BamH 
I fragments and 2806, 1 165 and 858 bp Xba I fragments. Correct clones were 
designated p597. 

10 A DNA fragment encompassing the HSA genomic sequence including 

part of exon 1 , intron 1 , exon 2, intron 2 and part of exon 3 was produced by 
PCR technology using the following synthetic oligonucleotide primers. 



15 



Exon 1 Sense PHmpr 

mrirtTTT Mfil. BfltRTT 

5 ' -GTACATAAGCTTTGGrAPAATflAASTG GGTAArrTT -3 • 

Exon 1 sequence 



20 



Exon 3 Antlsense Primer 



3« PTU IT BamH T 

25 Exon 3 sequence 



High molecular weight DNA purified from human lymphocytes was used 
as a template. This PCR product was digested with Hind III and Pvu II and the 
resultant 2422 bp fragment gel and elutip purified and ethanol precipitated. 

30 This fragment was ligated into a pGEM-1 vector prepared by digestion with 
Hind III and Pvu II, gel and elutip purification and ethanol precipitation of the 
resultant 2774 bp fragment. The pGem fragment was subsequently 
dephosphorylation with CIP. Ligation products were used to transform DH5 
cells to ampiciliin resistance. Correct recombinants clones were identified by 

35 the presence of a BstEII site resulting in linearization of plasmid (5196 bp) and 
the generation of 2774 and 2422 bp fragments upon digestion with Hind III and 
Pvu II. Correct recombinants were designated p594. 

HSA introns 1 and 2 were introduced into the HSA cDNA as follows. 
40 HSA sequences from exon 1 to exon 3 including introns 1 and 2 were released 
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from p594 by digestion with BstEll (within exon 1) and Pvu II (within exon 3) as 
a 2401 bp fragment. This fragment was gel and elutip purified and ethanol 
precipitated. Plasmid p597 was similarly digested with BstEll and Pvu II. The 
4596 fragment (which lacks the cDNA sequences between these sites in the 
5 cDNA), was gel and elutip purified and ethanol precipitated. These two 
prepared fragments were ligated and ligation products transformed into DH5 
cells to ampicillin resistance. Transformants containing the HSA introns were 
identified by Whatman 541 filter lifts and hybridization with a 3 2 P-5*-end 
labeled (using T4 polynucleotide kinase) synthetic oligonucleotide probe. The 
1 0 synthetic oligonucleotide used has HSA intron 2 specific sequence, i.e., 
5' GTCACATGTGGCTAATGGCTACTG 3'. 

Hybridization was in standard buffer at 60°C. Filters were washed with 
the final wash of 2 x SSC, 0.5% SDS, 60°C. Positive colonies were subjected 
15 to restriction analysis. Correct recombinants were identified by 2 BamH I 
(approximately 4174 and 2818 bp) fragments and 2 EcoR I (approximately 
3765 and 3227 bp) fragments. These correct clones contain an HSA minigene 
possessing HSA introns 1 and 2 and were designated p603. 

20 The HSA minigene with introns 1 and 2 was introduced into BLG vector 

p590 as follows. Vector p590 was digested with SnaB I and dephosphorylated 
with CIP. The HSA minigene was released from p603 as a 4174 bp BamH I 
fragment which was gel and elutip purified and ethanol precipitated. The 
BamH I ends of this fragment were blunted using Klenow polymerase and 

25 excess dNTPs. The prepared minigene from p603 and the prepared vector, 
p590, were ligated together. Ligation products were transformed into DH5 
cells to ampicillin resistance. Correct clones were identified by the presence of 
2 EcoR I fragments ( approximately 7485 and 3765 bp) and 2 Xba I fragments 
(approximately 9196 and 2054 bp). The correct clones have the HSA 

30 minigene inserted into the SnaB I site of the BLG vector p590 in the same 
orientation as the BLG promoter and were designated p607. 

B. CONSTRUCTION OF A BLG VECTOR WITH A DOWNSTREAM SV40 
POLYADENYLATION POLY A SIGNAL (p589) 

35 

In order to examine whether the poiyadenylation signal source affected 
expression of the HSA protein, a BLG vector having the BLG coding sequence 
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replaced by an HSA coding sequence was constructed having an SV40 
polyadenylation signal instead of the BLG poly A signal. The BLG sequences 
contained within the construct made here are only those upstream of the BLG 
coding sequence which include the promoter region. The BLG coding 
5 sequences as well as downstream sequences were deleted. The construct 
maintains the SnaB I site in BLG exon I for the insertion of foreign genes, in this 
case HSA. An SV40 polyadenylation signal was placed downstream of the 
SnaB I site in order to supply this, 3-RNA processing signal. 

1 0 Plasmid p583 containing the 5'-EcoR I subgenomic portion of the BLG 

gene with an SnaB I site introduced in exon 1 was digested with SnaB I and 
partially digested with Sal I. The DNA fragment of approximately 5800 bp 
made up of almost the entire pGEM-1 plasmid digested at its native Sal I site 
within its polylinker and the 5'-end of the 5'-portion of the BLG gene to the 

15 introduced SnaB I site was gel and elutip purified and ethanol precipitated. 

The SV40 early gene (T and t) polyadenylation signal poly A was 
released from SV40 DNA by restriction with Bell at its 5'-end (SV40 map 
position 2770) and BamH I at its 3-end (SV40 map position 2533). The 237 bp 

20 poly A signal and site fragment was gel and elutip purified and ethanol 

precipitated. This fragment was ligated into the BamH I site (dephosphorylated 
with CIP) of pGEM2. Ligation products were transformed into HB101 cells to 
ampicillin resistance. As the SV40 poly A signal fragment was able to be 
ligated into the BamH I site of pGEM in either orientation, the desired 

25 orientation was determined by analysis of plasmid DNAs from selected clones 
by digestion with Dral. Clones with the desired orientation, (the SV40 poly A 
signal fragment 5-end (Bell end) downstream of the polylinker Xba I and Sal I 
sites) were characterized by Dral fragments of 1203, 1192, 692 and 19 bp. 
Desired recombinants were designated p290. Additional restriction sites 

30 including SnaB I were introduced upstream of the polyA signal as follows. 
Plasmid p290 was digested with Sac I and Aval (within the pGEM2 polylinker), 
the large fragment gel and elutip purified and ethanol precipitated. Into the 
resultant fragment was ligated synthetic oligonucleotides which when 
annealed were of the sequence; 

35 

Sac I SnaB T Hind in 

5' CCTCGAGTACGTAAGATCTAAGCTTC 3' 

3 ' TCGAGGAGCTCATGCA TTCTAGAT TCGA AGGftrr. 5 ' 
Xho I Bgl II Ava I 
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Ligation products were transformed into E^coli HB101 cells to ampicillin 
resistance. Resultant correct recombinants which now possessed these 
additional multiple cloning sites upstream of the poly A signal sequence were 
5 designated p299. 

The SV40 poly A signal and site sequences were released from p299 by 
digestion at the upstream SnaB I and downstream Sal I sites. This 270 bp 
fragment was gel and elutip purified and ethanol precipitated. It was ligated 
10 into the SnaB l/partial Sal I purified fragment from p583. Ligation products 
were transformed into DH5 cells to ampicillin resistance. Correct recombinants 
were identified by linearization by SnaB I (approximately 6200 bp) and the 
generation of 2 fragments (approximately 3400 and 2800 bp) upon digestion 
with Sal I. Correct recombinants were designated p589. 

15 

C. CONSTRUCTION OF BLG VECTOR (BLG S'-SEQUENCES, SV40 3'- 
POLY A SIGNAL, NO BLG CODING SEQUENCE) WITH AN HSA 
MINIGENE CONTAINING HSA INTRON 1 (p598) 

20 Construct p589 was digested with SnaB I and resultant restricted ends 

dephosphorylated with CIP. This DNA was ligated with the prepared HSA 
minigene with intron 1 blunt ended as described in the section on the 
concentration of p599 and ligation products introduced into DH5 cells by 
transformation. Correct recombinants with the HSA minigene cloned into the 

25 SnaB I site in the same orientation as the BLG promoter were identified by the 
production of a single EcoR I linear fragment (approximately 8800 bp) and 
EcoR I/Sal I fragments of approximately 3450, 2850 and 2500 bp. These 
correct clones were designated p598. 

30 D. INTRODUCTION OF AN HSA MINIGENE CONTAINING THE FIRST HSA 
INTRON INTO A BLG VECTOR WITH EXTENDED 5 -SEQUENCES 
(p643, p647) 

In order to ascertain whether longer 5' BLG would increase the 
35 expression of HSA, an HSA minigene, with intron 1 , was introduced into BLG 
constructs with extended 5-BLG sequences (5.5 Kb or 10.8 Kb). BLG coding 
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and 3'-sequences are not included into these vectors. The SV40 sequences 
containing the polyadenylation signal and site are included. 

In order to produce the clone with the BLG 5.5 Kb 5'-sequence construct 
5 p640 was digested with Asp 71 8, (between the original BLG 5-EcoR I site and 
the transcriptional initiation site), and Pvu I (within pGEM). The fragment 
(approximately 6143 bp) which includes pGEM sequences and the BLG 
sequences upstream of the Asp 718 site was gel and elutip purified and 
ethanol precipitated. Vector p598 was similarly digested with Asp 718 and Pvu 

10 I. The fragment (approximately 5184 bp) which includes pGEM sequences, 
complementary to those above, as well as BLG 5-sequences downstream of 
the Asp 718 site, the HSA minigene with its first intron and the SV40 poly A 
site, was gel and elutip purified and ethanol precipitated. These two fragments 
were ligated together and ligation products transformed into DH5 cells to 

1 5 ampicillin resistance. Correct recombinants were identified by the generation 
of 3 fragments (approximately 8164, 2831 and 320 bp) upon Hind III digestion 
and were designated p643. 

The BLG 10.8 Kb 5-sequence construct was made in a similar manner 
20 asforp643. Plasmid p645 was digested with Asp 71 8 and Pvu I. The resultant 
fragment (approximately 1 1 ,384 bp) was gel and elutip purified and ethanol 
precipitated. This fragment was ligated to the purified fragment (approximately 
5184 bp) generated by the digestion of p598 with Asp 718 and Pvu I discussed 
above. Ligation products were transformed into DH5 cells to ampicillin 
25 resistance. Correct recombinants were identified by the generation of 5 DNA 
fragments (approximately 8159, 3460. 2829, 1800 and 320 bp ) upon digestion 
with Hind III. Correct recombinants were designated p647 and include 5'-BLG 
sequences of 10.8 Kb in combination with an HSA minigene possessing intron 
1. 

30 

Examnla 7 



CONSTRUCTION OF BLG VECTORS WITH AN HSA MINIGENE 
CONTAINING HSAHNTRONS 1-6 (p652, p661) 
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Transgenic BLG constructs with an HSA minigene containing HSA 
introns 1-6 were made by taking advantage of the unique BstEII (within exon 1) 
and the Nco I (within exon 7) sites within the HSA gene. 

5 Construct p598 f which is equivalent to the desired vector except that its 

HSA minigene contains only intron 1, was digested with BstEII and Nco L The 
DNA fragment (approximately 7304 bp) lacking HSA sequences between 
these two sites was gel and elutip purified and ethanol precipitated. Plasmid 
pS50 was similarly digested with BstEII and Nco I. The resultant 7,756 bp 

10 fragment containing the HSA gene region between these sites including 
introns 1-6 was also gel purified and ethanol precipitated. The two purified 
fragments were ligated together and transformed into TG1 cells to ampicillin 
resistance. Correct recombinants were identified by 4 DNA fragments 
(approximately 9819, 4682, 320, and 240 bp) upon digestion with Hind III and 

15 5 fragments (approximately 9552, 1882, 1296, 1258, and 1073 bp) upon 
digestion with Xba I. These were designated p652 (ATCC No. 68653). 
Construct p652 possesses a SV40 poly A site. 

A second BLG vector similar to p652, except that 3 -sequences 
20 containing the polyadenylation signal and site were contributed by BLG 3'- 
sequences, was constructed in a parallel manner. Construct p600 was 
digested with BstEII and Nco I. The DNA fragment of approximately 8266 bp, 
lacking HSA sequences between these sites was gel and elutip purified and 
ethanol precipitated. It was ligated with the purified 7756 bp p650 BstEII - Nco I 
25 fragment discussed above, composed of HSA gene sequences between these 
two sites. Ligation products were transformed into DH5 cells to ampicillin 
resistance. Correct recombinants were identified by the possession of unique 
BstEII and Nco I sites, resulting in the generation of a single DNA fragment 
(approximately 16021 bp) upon digestion with each restriction enzyme. In 
30 addition, correct clones were identified by the generation of 3 DNA fragments 
(10945, 4171 , and 905 bp) upon digestion with BamH I. Correct clones were 
designated p661. 



35 



Example 8 

CONSTRUCTION OF BLG VECTOR WITH AN HSA MINIGENE 
CONTAINING HSA INTRONS 7-14 (p687) 
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This transgenic vector made up of an HSA minigene with introns 7-14 
under the control of the BLG promoter and with an SV40 poly A site was 
constructed in several steps. The downstream HSA gene region containing 
5 introns 12-14 (within. p651) was combined with HSA cDNA sequences up to 
intron 12, thereby creating an HSA minigene with introns 12-14. An SV40 poly 
A site was then joined to the 3'-end of this minigene. This was then 
manipulated to include introns 7-1 1 to produce an HSA minigene with introns 
7-14, with an SV40 poly A site. Finally, the minigene was recombined with the 
1 0 5'-BLG promoter sequences resulting in construct p687. 

In the first step, construct p651 was digested with Asp I (within HSA exon 
12) and Hind III (within HSA intron 15). The resultant, desired, fragment (2972 
bp) extending from exon 12 to exon 1 5 was gel and elutip purified. 

1 5 Translational termination of the HSA RNA occurs within exon 14 derived 
sequences and polyadenylation of the HSA transcript occurs downstream of 
the Hind III site. Therefore, the isolated fragment has been separated from the 
HSA poly A signal and site but has not had any coding sequences deleted. 
Construct p597 (HSA cDNA within pGEM-1) was similarly digested with Asp I 

20 and Hind III. The resultant fragment (approximately 4263 bp) with the HSA 
cDNA sequences between the Asp I (within exon 12 derived sequences) and 
Hind III (within exon 15 derived sequences) was gel and elutip purified. The 
two fragments were ligated together and ligation products introduced into 
£0li MC1061 cells by electroporation to ampicillin resistance. The correct 

25 recombinants were characterized by the presence of single Asp I and Hind III 
sites as well as 2 fragments (approximately 4275 and 2800 bp) upon digestion 
with EcoR I and BamH I and were designated p668. Construct p668 contains 
an HSA minigene with introns 12-14 within pGEM-1. 

30 SV40 poly A signal and site sequences were introduced downstream of 

the HSA minigene in p668. Construct p668 was digested with Hind ill (at the 
downstream end of HSA sequences) and the blunt cutter Nae I (within pGEM 
sequences approximately 300 bp from the Hind III site). The fragment 
(approximately 6800 bp), deleted of the 300 bp between the two sites, was gel 

35 and elutip purified. SV40 poly A site sequences were released from p299 by 
first digesting with Pstl (within the multiple cloning site adjacent to the 
downstream end of the poly A site sequences). The Sal I site between the poly 
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A site sequences and the Pstl site remains adjacent to the downstream end of 
the poly A site sequences. The digested Pstl site was blunted by filling in with 
T4 DNA polymerase in the presence of excess dNTPs. The DNA was then 
digested with Hind III (within the multiple cloning site adjacent to the upstream 
5 end of the poly A site sequences) and the released SV40 sequences fragment 
(approximately 200 bp) was gel and elutip purified. The two purified DNAs 
were ligated together and transformed into fLcM DH5 cells to ampicillin 
resistance* Correct clones with the SV40 poly A site ligated at the downstream 
end of the HSA minigene in the same orientation were characterized by the 
1 0 generation of 2 fragments (approximately 4700 and 2300 bp) upon Sal I 

digestion and 3 fragments (approximately 4200, 2300 and 500 bp) upon Sal I 
and EcoR I digestion. These correct constructs were designated p674. 

The introduction of HSA introns 7-1 1 into p674 containing the HSA 

15 minigene with introns 12-14 was accomplished by a tripartite ligation. The 
HSA gene sequences contained within p679 were released by digestion with 
Nco I (within exon 7) and Hind III (within the downstream end of intron 8) as a 
fragment of 2814 bp which was gel and elutip purified. The HSA sequences 
contained within p676 were released by digestion with Hind III (the same site 

20 within the downstream end of intron 8 described for p679) and partial digestion 
with Asp I (within exon 12; a second Asp I site is found within intron 1 1) as a 
fragment of 3237 bp which was gel and elutip purified. Construct p674 
(containing the HSA minigene with introns 12-14) was digested with Nco I 
(again within exon 7) and Asp I (again within exon 12). The resultant DNA 

25 (approximately 6400 bp) deleted of HSA cDNA sequences between the Nco I 
and Asp I sites was gel and elutip purified. The 3 purified DNAs were ligated 
together and ligation products introduced into JEiCQ]LMC1061 cells by 
electroporation to ampicillin resistance. Correct recombinants were 
characterized by the generation of 2 Asp I fragments (approximately 12406 and 

30 273 bp) and 5 Xba I fragments (approximately 3537, 2719, 2623, 2566 and 
1234 bp). These correct constructs, designated p683, are HSA minigenes with 
introns 7-14, with an adjacent downstream SV40 poly A site within pGEM-1. 

Finally, the HSA minigene with introns 7-14 was introduced into a 
35 transgenic construct downstream of the BLG S'-flanking promoter sequences. 
Construct p683 was digested with Nco I (within HSA exon 7) and Pvu I (within 
pGEM). The DNA fragment (approximately 10400 bp) made up of the HSA 
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gene sequences downstream of the Nco I site (including introns 7-14), the 
SV40 poly A site and the adjacent pGEM sequences was gel and elutip 
purified. Construct p572 (containing the HSA cDNA within the Pvu II site of BLG 
exon 1) was similarly digested with Nco I (within exon 7 derived sequences) 
5 and Pvu I (within pGEM). The DNA fragment (approximately 5570 bp) made up 
of pGEM sequences complementary to the pGEM sequences In the fragment 
described above, BLG 5*-flanking promoter sequences and the HSA cDNA up 
to the Nco I site within exon 7, was gel and elutip purified. The two purified 
fragments were ligated together and ligation products were introduced into 
1 0 f^&li DH1 OB cell by electroporation to ampicillin resistance. 

Correct recombinants were characterized by 3 fragments (approximately 
10098, 4183 and 1699 bp) generated by digestion with BamH I. These correct 
constructs, designated p687, contain the HSA minigene with introns 7-14, a 
1 5 downstream SV40 poly A site and an upstream BLG promoter and represent a 
final BLG/HSA construct for introduction into transgenic animals. 

Example 9 

20 CONSTRUCTION OF BLG VECTOR WITH AN HSA MINIGENE 
CONTAINING INTRONS 2 AND 7-14 (p696) 

The construction of this transgenic vector containing an HSA minigene 
wjlti introns 2 and 7-14 was performed in several steps. The Cla I site which 

25 had previously been introduced into HSA exon 2 derived sequences (without 
altering the encoded amino acid) irran HSA cDNA In plasmid pBS* (p595) was 
transferred into an HSA cDNA within a pGEM plasmid. A PCR product 
extending from HSA exon 2 through intron 2 and into exon 3 was introduced 
into the homologous region in the HSA cDNA within pGEM using this Cla I site. 

30 Finally, intron 2 was transferred into the transgenic construct carrying an HSA 
minigene with introns 7-1 4 (p687). 

The Cla I site which had been introduced into HSA exon 2 was 
transferred from the HSA cDNA in construct p595 (pBS" plasmid) to an HSA 
35 cDNA in a pGEM plasmid (p597) because there are multiple Pvu II sites within 
p595 which interfere with the subsequent recombination strategy (p597 has a 
unique Pvu II site). Construct p597 was digested with BstEII (within HSA exon 
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1 derived sequences) and Nco I (within HSA exon 7 derived sequences). The 
large DNA fragment deleted of the HSA cDNA sequences between the BstEII 
and Nco I sites was gel and elutip purified. Construct p595 Was similarly 
digested with BstEII and Nco I and the DNA fragment (801 bp) of the HSA 
5 cDNA sequences between these sites, including the previously introduced Cla 
I site, was gel and elutip purified. These two purified DNAs were ligated 
together and ligation products introduced into E.coli DH10B cells by 
electroporation to ampidllin resistance. Correct recombinants were 
characterized by the presence of a Cla I site and were designated p689. 

10 

HSA intron 2, with flanking exon regions including the introduced Cla I 
site in exon 2. was obtained by PCR technology. The HSA exon 2 sense 
synthetic oligonucleotide primer (with incorporated Cla I site) and the HSA 
exon 3 antisense primer (with incorporated Pvu II site), shown below, were 
1 5 used for a PCR with construct p594 (containing HSA intron 2 and flanking 
exons) as template. 

HSA Exon 2 5' GGTTGCT CATCGATT TAAAGATTTGGG 3' 

Sense Primer Cla I 

20 

HSA Exon 3 3 * GACTACTC AGTCGACT TTTAAcftCTTAAGGGACTG 5 ' 

Antisense Primer Pvu II (Noij-hsa sequences) 

25 The reaction was of 28 cycles of 94° C (1 '30"), 55° C (2') and 72° C (3'). 
Following two chloroform extractions and ethanol precipitation the PCR 
products were digested with Cla t and Pvu II. The resultant 1 602 bp DNA was 
gel and elutip purified. Construct p689 was digested with Cla I (within HSA 
exon 2) and Pvu II (within HSA exon 3) and the large DNA fragment deleted of 

30 HSA cDNA sequences between the Cla I and Pvu II sites was gel and elutip 
purified. The two purified DNAs were ligated together and ligation products 
transformed into E.coli DH5 cells to ampicillin resistance. The generation of 
linear DNA of approximately 6500 bp by Cla I and Pvu II, individually, identified 
correct recombinants. These were designated p690 and are composed of an 

35 HSA minigene with intron 2 within a pGEM plasmid. 
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Finally, HSA Intron 2 was transferred into transgenic vector p687 
containing an HSA minigene with introns 7-14. Construct p687 was digested 
with BstEII (within HSA exon 1) and Nco I (within HSA exon 7). The large DNA 
fragment deleted of HSA cDNA sequences between the BstEII and Nco I sites 
5 was gel and elutip purified. Construct p690 was similarly digested with BstEII 
and Nco I. The released 2255 bp fragment made up of HSA cDNA from the 
BstEII site to the Nco I site and includes intron 2, was gel and elutip purified. 
The two purified DNAs were ligated together and ligation products transformed 
into EcpJi DH5 cells. Correct recombinants were characterized by the release 
10 of a 2255 bp fragment upon digestion wjfli BstEII and Nco I. These constructs, 
designated p696 (ATCC No. 68654), are made up of an HSA minigene with 
introns 2 and 7-14 under the control of the BLG promoter and flanked 
downstream by an SV40 poly A site and are suitable for the production of 
transgenic animals. 



15 



ExamnlB 10 



CONSTRUCTION OF BLG VECTOR WITH A COMPLETE 
HSA GENE INCLUDING ALL INTRONS, 1-14 (p686) 

20 

The construction of the transgenic vector utilizing the entire HSA gene 
including all 14 of its Introns was by recombining construct p652. containing an 
HSA minigene with introns 1-6, and construct p683, containing an HSA 
minigene with introns 7-14. Construct p652 was digested with Nco I (within 

25 HSA exon 7) and Pvu I (within pGEM). The DNA fragment (approximately 

12530 bp) made up of pGEM sequences, the BLG 5*-flanking promoter and the 
HSA minigene (including introns 1-6) up to the Nco I site in exon 7 was gel and 
elutip purified. Construct p683 was similarly digested with Nco I (within HSA 
exon 7) and Pvu I (within pGEM). The DNA fragment (approximately 10400 bp) 

30 made up of the HSA minigene downstream of the Nco I site in exon 7 

(including introns 7-14), the adjacent downstream SV40 poly A site and its 
adjacent pGEM sequences complementary to the pGEM sequences in the 
fragment described above, was gel and elutip purified. The two purified 
fragments were ligated together and ligation products were introduced into 

35 Ecpl DH1 OB cells by electroporation to ampicillin resistance. Correct 
recombinants were characterized by the generation of two fragments 
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(approximately 18728 and 4186 bp) upon digestion with BamH I and were 
designated p686. 

All exons were sequenced in this vector and found to be correct thereby 
5 confirming that clonings of the HSA gene had been correct. 

Fxampls 11 

A. CONSTRUCTION OF A VARIANT HSA cDNA WHICH ENCODES A PHE 
1 0 AMINO ACID AT HSA PROTEIN POSITION #403 INSTEAD OF A LEU AMINO 

ACID (p822) 

The HSA cDNA sequence used in the constructs described above 
encoded an HSA protein sequence which differed from the HSA sequence 

1 5 described by Minghetti et al. (1986) by one amino acid. The Minghetti 

sequence encodes a Phe amino acid (TTC codon) at position #403 (based 
upon numbering of amino acids with the first amino acid of the mature HSA 
protein designated position #1) while that described in the examples above 
encoded a Leu amino acid (codon CTC). The sequence of Minghetti was 

20 prepared in order to demonstrate that the vectors of the present invention were 
able to result in high level expression of variant forms of HSA. In vitro 
mutagenesis (site directed mutagenesis) was performed using a commercial kit 
(Amersham SDM Kit), with single stranded DNA template derived from pHSA- 
F1~, a synthetic oligonucleotide of the sequence 

25 S-GGAGAGTACAAATTCCAGAATGCGCTA-S' as a primer for first strand 
synthesis and a synthetic oligonucleotide of the sequence 
5 , -AGCGCATTCTGGAATTTGTACTCT-3 , as a primer for second strand 
synthesis. The resultant clone, made up of an HSA cDNA encoding a Phe 
(TTC codon) at HSA amino acid position #403 within the pBS" vector, was 

30 designated p822. The desired change was verified by DNA sequencing. 

B. CONSTRUCTION OF AN IN VITRO ANALYSIS VECTOR FROM p822 
(p823) 

35 An in vitro analysis vector containing the HSA cDNA with the Minghetti 

codon sequence for HSA amino acid #403 was constructed. Construct p822 
was digested with Nco I and Avr II (sites which flank the codon for amino add 
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#403). The resultant 549 bp DNA fragment, which contained the codon for 
amino acid #403, was gel and elutip purified. In vitro analysis vector p658 
(containing the HSA cDNA) was similarly digested with Nco I and Avr II. The 
large DNA fragment lacking the HSA sequences between the two sites was gel 
5 and elutip purified. The two purified fragments were ligated together and 
introduced into E. coli DH5 cells to confer ampicillin resistance. Correct 
recombinants were identified by the generation of a 549 bp fragment upon 
digestion with Nco I and Avr II. Sequencing verified that the Minghetti codon 
for amino acid #403 was present. This new in vitro analysis vector was 
10 designated p823. 

In vitro analysis, as previously described, demonstrated that p823 
supports the expression and secretion of immunoprecipitable HSA from 
transfected mammalian cell line COS-7 cells. 

15 

Example 12 

A. CONSTRUCTION OF A BLG VECTOR WITH AN HSA MINIGENE 
CONTAINING INTRONS 1-6 WITH A PHE CODON FOR HSA AMINO ACID 
20 #403 (p825) 

A transgenic vector containing an HSA minigene with introns 1-6 and 
encoding the Minghetti amino acid sequence at position #403 was constructed. 
Vector p652 was digested with Nco I (within HSA exon 7) and Pvu I (within 

25 pGEM). The large DNA fragment, comprising the BLG promoter and HSA 
sequences upstream of the Nco I site, including HSA introns 1-6, was gel and 
elutip purified. Construct p823 was similarly digested with Nco I and Pvu I. 
The smaller DNA fragment (approximately 2548 bp) comprising HSA cDNA 
sequences downstream of the Nco I site (including Minghetti codon sequence 

30 for amino acid #403) and the SV40 polyadehylation site was gel and elutip 
purified. The two purified fragments were ligated together and introduced into 
E. coli DH5 cells to confer ampicillin resistance. Correct recombinants were 
identified by the generation of 5 DNA fragments (approximately 9308, 1882, 
1296, 1257 and 1091 bp) upon digestion with Xba I and 6 DNA fragments 

35 (approximately 5916, 3338, 2137, 1768, 1039 and 636 bp) upon digestion with 
Asp I. This new transgenic vector was designated p825. 
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B. CONSTRUCTION OF AN IN VITRO ANALYSIS VECTOR FROM p825 
(p829) 

An in vitro analysis vector was made from transgenic vector p825 
5 (possessing an HSA minigene with introns 1-6, Minghetti codon for amino acid 
#403) by introducing an SV40 enhancer into the Asp 718 site within the BLG 
promoter as previously described for the construction of in vitro vector p656 
from transgenic vector p652. This new vector was designated p829. 

1 o In vitro analysis , as previously described, demonstrated that ivector 

p829 supports the expression and secretion of immunoprecipitable HSA from 
transfected mammalian cell line COS-7 cells. 



15 Example 13 

CONSTRUCTION OF BLG VECTORS WITH AN HSA MINIGENE CONTAINING 
INTRONS12-14 WITH EITHER LEU OR PHE CODON FOR HSA AMINO ACID 
#403 (p688, p824) 

20 

A transgenic construct containing an HSA minigene with introns 12-14 
was made as follows: Construct p674 was digested with Nco I (within HSA 
exon 7) and Pvu I (within pGEM). The large DNA fragment generated, 
comprised of the HSA sequences downstream of the Nco I site, including HSA 

25 introns 12-14, and the SV40 polyadenylation site was gel and eiutip purified. 
Construct p572 was similarly digested with Nco I and Pvu I and the large 
fragment generated comprised of the BLG promoter and HSA sequences 
(cDNA) upstream of the Nco I site was gel and eiutip purified. The two purified 
fragments were ligated together and introduced into E. coli DH10B cells to 

30 confer ampicillin resistance. Correct recombinants were identified by the 

generation of 3 DNA fragments (approximately 4767, 4177 and 1700 bp) upon 
digestion with BamH I and 3 DNA fragments (approximately 7028, 2757 and 
859 bp) upon digestion with Xba I. This new transgenic vector possessing an 
HSA minigene with introns 12-14 was designated p688. 

35 

Transgenic vector p688 containing an HSA minigene with introns 12-14 
was modified so that it encodes the Minghetti amino acid sequence at position 
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#403. The 549 bp Nco I to Avr II DNA fragment obtained from construct p822 
was used to replace the corresponding fragment in construct p688 (similarly 
digested with Nco I and Avr II) as previously described for the construction of 
vector p823. This new transgenic construct was designated p824. 

5 

Example 14 

A. CONSTRUCTION OF BLG VECTORS WITH AN HSA MINIGENE 
CONTAINING INTRONS 1 and 2 and 12-14 WITH EITHER LEU OR PHE 
1 0 CODON FOR HSA AMINO ACID #403 (p81 2, p826) 

A transgenic construct containing an HSA minigene with Introns 
1+2+12-14 was made as follows: Construct p674 was digested with Nco I 
(within HSA exon 7) and Pvu I (within pGEM). The large DNA fragment 

1 5 generated, comprised of the HSA sequences downstream of the Nco I site, 
including HSA introns 12-14, and the SV40 polyadenylation site was gel and 
elutip purified. Construct p607 was similarly digested with Nco I and Pvu I and 
the large fragment generated comprised of the BLG promoter and HSA 
sequences upstream of the Nco I site, including HSA introns 1 and 2, was gel 

20 and elutip purified. The two purified fragments were ligated together and 
introduced into E. coli DH5 cells to confer ampicillin resistance. Correct 
recombinants were identified by the generation of 3 DNA fragments 
(approximately 8900, 2718 and 854 bp) upon digestion with Xba I. This new 
transgenic vector possessing an HSA minigene with introns 1+2+12-14 was 

25 designated p812. 

A transgenic construct containing an HSA minigene with introns 
1+2+12-14 with the Minghetti codon for HSA amino acid #403 was made. 
Construct p824 was digested with Nco I (within HSA axon 7) and Pvu I (within 

30 pGEM). The large DNA fragment generated, comprised of the HSA sequences 
downstream of the Nco I site (including HSA introns 12-14 and Minghetti codon 
for #403) and the SV40 polyadenylation site was gel and elutip purified. 
Construct p607 was similarly digested with Nco I and Pvu I and the large 
fragment generated comprised of the BLG promoter and HSA sequences 

35 upstream of the Nco I site, including HSA introns i and 2, was gel and elutip 
purified. The two purified fragments were ligated together and introduced into 
E. coli DH5 cells to confer ampicillin resistance. Correct recombinants were 
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identified by the generation of 3 DNA fragments (approximately 8900, 2718 
and 854 bp) upon digestion with Xba I. This new transgenic vector was 
designated p826. 

5 B. CONSTRUCTION OF IN VITRO ANALYSIS VECTORS FROM p81 2 AND 
p826 (p813, p830) 

An in vitro analysis vector was made from transgenic vector p812 
(possessing an HSA minigene with introns 1+2+12-14) by introducing an 
10 SV40 enhancer into the Asp 718 site within the BLG promoter as previously 
described for the construction of in vitro vector p656 from transgenic vector 
p652. This new vector was designated p81 3. 

In vitro analysis , as previously described, demonstrated that vector 
15 p813 supported the expression and secretion of immunoprecipitable HSA from 
transfected mammalian cell line COS-7 cells. 

An in vitro analysis vector was made from transgenic vector p826 
(possessing an HSA minigene with introns 1+2+12-14, Minghetti codon for 
20 amino acid #403) by introducing an SV40 enhancer into the Asp 718 site 

within the BLG promoter as previously described for the construction of in vitro 
vector p656 from transgenic vector p652. This new in vitro analysis vector was 
designated p830. 



25 In vitro analysis , as previously described, demonstrated that vector 

p830 supports the expression and secretion of immunoprecipitable HSA from 
transfected mammalian cell line COS-7 cells. 



30 



Example 15 

IN VITRO (TISSUE CULTURE) ANALYSIS OF BLG/HSA VECTORS 



In order to determine that all of the BLG/HSA vectors introduced into 
transgenic animals had the potential ability to support the expression of HSA in 
35 the milk of such animals, the ability to support expression of HSA in tissue 
culture cells was first tested. The natural in YiYQ regulation of expression of 
milk proteins under the control of their native promoters (e.g.,. BLG) is complex 
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and requires the influence of hormones and specific cell-cell interactions. The 
BLG 5-flanking promoter sequences are not usually active in tissue culture 
cells and tissue culture systems which precisely mimic the natural in vivo 
conditions did not exist. 

5 

In order to stimulate these sequences into activity an SV40 enhancer 
was introduced within the promoter. This allowed the testing of the levels of 
expression of HSA in tissue culture supported by BLG/HSA constructs which 
differ in their HSA gene makeup (cDNA, minigenes, gene). In order to keep the 

1 0 genetic background the same in these in vitro analysis constructs, a series of 
constructs were made which differed only in the HSA gene components. An 
SV40 enhancer was first introduced into transgenic construct p652 (3 kb BLG 
5-flanking promoter sequences; HSA minigene with introns 1-6; SV40 poly A 
site). The resultant in yilTQ construct was then used to make all other constructs 

1 5 of this series so that while the HSA sequences varied, the BLG promoter with 
introduced SV40 enhancer was identical in all. In addition all possessed the 
same SV40 poly A site downstream of the HSA sequences. 

Construct p652 was digested with Asp 718 (within the BLG promoter, 

20 approximately 900 bp upstream of the BLG transcriptional start site). The 
linearized construct was extracted two times with phenol/chloroform and 
ethanol precipitated. The Asp 718 digested ends were blunted by filling in with 
Klenow enzyme (Boehringer Mannheim) in the presence of excess dNTPs. 
Following the fill in reaction, the enzyme was heat inactivated (65° C, 15') in 

25 the presence of 10 mM EDTA. The sample was again extracted two times and 
ethanol precipitated. The linearized and blunted DNA was gel and elutip 
purified and it's 5'-ends dephosphorylated with calf intestinal alkaline 
phosphatase (Promega). The enzyme was heat inactivated (65° C, 15') in the 
presence of 20 mM EGTA. The sample was again extracted two times and 

30 ethanol precipitated. The SV40 enhancer was released from construct 

PSV2CAT (Gorman et aL . 1982, Mol. Cell. Biol. 2, 1044-1051) as a 179 bp Fok 
I ( cleaves at SV40 bp position 94) to Pvu II (cleaves at SV40 bp position 273) 
fragment. The Fok I site was blunt ended with Klenow polymerase and excess 
dNTPs. The fragment was gel and elutip purified and subsequently ligated to 

35 the prepared p652 fragment. Ligation products were transformed into E.coli 
TG1 cells to ampicillin resistance. Correct recombinants were identified by 
digestion with BamH I and EcoR I whose sites flank the Asp 718 site of p652. 
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The conversion of the approximate 2088 bp fragment to approximately 2267 bp 
identified correct recombinants with an SV40 enhancer introduced into the Asp 
718 site of p652. These were designated p656. (Fig. 2C) Jn Yjim analysis 
construct, p656 is capable of expression of the HSA minigene with introns 1-6 
5 in tissue culture cells. 

Several new in vitro analysis constructs were made directly using inyjlrjQ 
analysis construct p656. Construct p656 was digested with BstEII (within HSA 
exon 1) and Nco I (within HSA exon 7). The large DNA fragment deleted of 

1 0 HSA sequences between these two sites was gel and elutip purified. DNA 
fragments of HSA sequences between the BstEII site in exon 1 and the Nco I 
site in exon 7 made up of cDNA (801 bp) or which included intron 1 (1510 bp) 
or intron 2 (2255 bp) or introns 1 and 2 (2964 bp) were derived from p582 
(containing HSA cDNA, discussed below), p600 (containing HSA minigene 

15 with intron 1), p690 (containing HSA minigene with intron 2) or p607 

(containing HSA minigene with introns 1 and 2), respectively, by digestion with 
BstEII and Nco I. Fragments were gel and elutip purified and individually 
iigated into the purified p656 fragment lacking sequences between these two 
sites. Ligation products were introduced into Ecoli DH5 alpha ceils by 

20 transformation or E.coli D H10B cells by electroporation to ampicillin resistance. 
Correct recombinants were identified by the generation of DNA fragments of 
801 , or 1510, or 2255, or 2964 bp, respectively, upon digestion with BstEII and 
Nco I. The new in vitro analysis construct containing HSA cDNA was 
designated p658. The constructs containing HSA minigene with intron 1 was 

25 designated p659, with intron 2, p691 , and with introns 1 and 2, p660. 

in addition to making a BLG/HSA in vitro analysis construct with the HSA 
cDNA, we also made an in vitro analysis construct with the HSA cDNA under 
the control of the highly active Adenovirus major late promoter and SV40 

30 enhancer combination. This allowed the evaluation of HSA expression from its 
cDNA in a construct other than a BLG construct. This construction was made in 
two steps. In the first step the SV40 early region small t splicing signals and 
poly A site was placed downstream of a polylinker, which itself is downstream 
of the major late promoter. An in yjlm analysis construct (referred to here as 

35 p550) made up of the major late promoter with an SV40 enhancer introduced 
at its EcoRV site and followed by a polylinker (Hurwitz et al.. 1987, Nucl. Acids 
Res. 15, 7137-7153.) was digested with BamH I (within the polylinker). The 
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linearized DNA ( 281 8 bp) was gel and elutip purified. It's 5'-ends were 
dephosphorylated with calf intestinal alkaline phosphatase. The enzyme was 
heat inactivated as previously described, extracted with phenol/chloroform two 
times and ethanol precipitated. This fragment was ligated to an approximately 
5 850 bp fragment (Bgl II at the upstream end and BamH I at the downstream 
end) consisting of SV40 small t splicing signals and a downstream poly A site 
(Mulligan and Berg, Science 209:1423-1427, 1980). Ligation products were 
introduced into E.coli DH5 cells by transformation to ampicillin resistance. 
Correct recombinants with the splicing signals and poly A site within the BamH 

1 0 I site of p550 in the same orientation as the major late promoter were 

characterized by the generation of two fragments (approximately 2809 and 856 
bp) upon digestion with BamH I and EcoR I. These were designated p566. 
The HSA cDNA was then introduced into p566. Construct p566 was digested 
with the blunt cutter Nae I and EcoR I just downstream of Nae I (both sites are 

1 5 within the polylinker between the major late promoter and the downstream 
SV40 splicing signals and poly A site). The HSA cDNA was obtained as 
follows. Construct pHSA-F1" was digested with BamH I (at the 5'-end of the 
HSA cDNA) and ethanol precipitated. The digested BamH I site was blunted 
with Klenow enzyme in the presence of excess dNTPs. The DNA was ethanol 

20 precipitated and digested with EcoR I (at the 3'-end of the HSA cDNA). The 
resultant cDNA fragment (1983 bp) was gel and elutip purified. It was then 
ligated into the prepared p566 DNA and ligation products introduced into DH5 
cells by transformation to ampicillin resistance. Correct recombinants were 
characterized by the restoration of the unique EcoR I site and the generation of 

25 fragments (approximately 4208, 1399 and 36 bp) upon digestion with Bgl II. 
This inyillQ analysis construct with the HSA cDNA under the control of the 
major late promoter was designated p582. 

HSA introns 12-14 were introduced into p658 as follows. Construct 
30 p658 was digested with Nco I (within HSA exon 7) and partially with Sal I (at 
the downstream end of the SV40 poly A site; a second Sal I site is found at the 
upstream end of the BLG promoter). The DNA fragment of approximately 6000 
bp deleted of HSA sequences downstream of the Nco I site In exon 7 and the 
SV40 poly A site was gel and elutip purified. Construct p674 was digested with 
35 Nco I (within HSA exon 7) and Sal I (at the downstream end of the adjacent 
SV40 poly A site). The DNA fragment of approximately 4000 bp made up of 
HSA sequences downstream of the Nco I site in exon 7, including exons 12-14, 
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and the poly A site was gel and elutip purified. These two fragments were 
ligated together and products were introduced into E.coli MC1061 cells by 
electroporation to ampicillin resistance. Correct recombinants were 
characterized by 2 DNA fragments (approximately 8021 and 2824 bp) upon 
5 digestion with Sal I and 3 fragments (approximately 7229, 2757 and 859 bp) 
upon digestion with Xba I. This newinyjiEQ analysis construct i _designated 
p682 A contains an HSA minigene with introns 12-1 4. 

A construct containing an HSA minigene with introns 7-14 was made by 

10 first digesting p658 (containing an HSA cDNA) with Nco I (within HSA exon 7) 
and Pvu I (within pGEM). The large DNA fragment (approximately 5569 bp) 
made up of the BLG promoter with introduced SV40 enhancer and HSA cDNA 
to the Nco I site in exon 7, as well as pGEM sequences to the Pvu I site 
adjacent to BLG sequences was gel and elutip purified. Construct p683 was 

15 also digested with Nco I (within HSA exon 7) and Pvu I (within pGEM). The 
DNA fragment (approximately 10400 bp) made up of HSA sequences 
downstream of the Nco I site, including introns 7-14, the SV40 poly A site and 
adjacent pGEM sequences (complementary to those found in the fragment 
above) was gel and elutip purified. The two purified fragments were ligated 

20 together and ligation products were introduced into E.coli DH5 alpha cells by 
electroporation to ampicillin resistance. Correct recombinants were identified 
by the generation of 2 fragments each upon digestion with BamH I 
(approximately 11998 and 4185 bp) or Hind III (approximately 9973 and 6210 
bp). This new in vitro analysis construct with an HSA minigene with introns 7- 

25 14 was designated p684. 

HSA introns 1 or 2 or 1 and 2 were introduced into the new inyjtia 
analysis constructs already containing HSA minigenes with either introns 12- 
14 (p682) or introns 7-14 (p684). Constructs p682 and p684 were each 

30 digested with BstEII (within HSA exon 1 ) and Nco I (within HSA exon 7) and 
the resultant large DNA fragments deleted of HSA sequences between these 
two sites were gel and elutip purified. Into each were individually ligated the 
purified DNA fragments, discussed above. (1510, or 2255 or 2964 bp) of HSA 
sequences between the BstEII and Nco I sites including intron 1 or intron 2 or 

35 introns 1 and 2, respectively. Ligation products were either introduced into 
E.coli DH5 cells by transformation or Ej;oJLDH10B cells by electroporation. 
Correct recombinants were identified for each set of parental clones, p682 and 
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p684, by the generation of 1510 bp (for the introduction of intron 1) 2255 bp (for 
the introduction of intron 2) or 2964 bp (for the introduction of introns 1 and 2) 
DNA fragments upon digestion with BstEII and Nco I. The designations of the 
new in vitro analysis constructs and their specific HSA minigene structure is 
5 listed below. 

p694 HSA minigene with introns 1+12-14 . 

p695 " - " - 2 + 12-14 

P697 " " " - 1+2 + 12-14 

10 p693 " " " " 1+7-14 

p692 " • " 2 + 7-14 

p698 " " " 1+2 + 7r14 

An inyilm analysis construct containing the entire HSA gene with all 14 
15 introns was made as follows. Construct p656 was digested with Nco I (within 
HSA exon 7) and Pvu I (within pGEM). The DNA fragment (approximately 
12326 bp) made up of pGEM sequences (from the Pvu I site), adjacent BLG 
promoter with introduced SV40 enhancer and the HSA sequences to the Nco I 
site within exon 7 including introns 1-6 was gel and elutip purified. To this 
20 fragment was ligated the purified p683 Nco I to Pvu I fragment (approximately 
10400 bp), discussed above, made up of HSA sequences downstream of the 
Nco I site in exon 7, including introns 7-14, the SV40 poly A site and adjacent 
pGEM sequences to the Pvu I site. Ligation products were introduced into 
E.coli DH10B cells by electroporation to ampicillin resistance. Correct 
25 recombinants were characterized by the generation of two fragments 

(approximately 18929 and 4186 bp) upon digestion with BamH I. This in vitro 
analysis construct containing the entire HSA gene with introns 1-14 was 
designated p685. 

30 The in yjtta tissue culture expression was accomplished by transient 

transfection. Tissue culture mammalian cell line COS-7 cells were split equally 
into 100 mm tissue culture dishes in DMEM medium plus 10% fetal calf serum 
(FCS) so that they were approximately 50-75% confluent (approximately 5x1 0 6 
cells). They were incubated overnight at 37 °C in a CO2 incubator. The next 

35 morning the medium was replaced with 5 ml of fresh medium and cells 

incubated for 1-2 hours. They were then transfected with BLG/HSA constructs 
(which included the SV40 enhancer) using the calcium phosphate technique 
(reagents supplied by 5 Prime -» 3 Prime, Inc.) by the supplier's protocol. 
Within each experiment the total amount of the largest construct (kb), for that 
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experiment, transfected into cells within a plate was 25 ng . In order to transfect 
equal molar amounts of smaller constructs, the amounts of each of these 
constructs were reduced proportionally to their size differences to the largest 
construct and the total amount of DNA for the each construct was brought up to 
5 25 ug using high molecular weight (HMW) salmon sperm (ss) DNA. Following 
transfection of cells for 4-5 hours, cells were glycerol shocked (3 ml, 2 minutes) 
and washed according to supplier's protocol and subsequently incubated in 10 
or 15 ml of DMEM medium plus 10% FCS for 3 days. 

1 0 In order to detect the transient expression and secretion of HSA, 

transfected cells were starved for amino acids cysteine (Cys) and methionine 
(Met) by first washing with and then incubating cells in DMEM medium (plus 
glutamine) lacking Cys and Met plus 5% dialyzed FCS (dFCS) for 1-3 hours. 
Following removal of medium from cells, cells (and da novo synthesized 

15 proteins) were metabolically labeled with 3 ml DMEM (plus glutamine, without 
Cys or Met, plus 10% dialyzed FCS) containing 35 S-Cys and 35 S-Met 
(Expre 35 S 35 S 35 S-Protein labeling mix; New England Nuclear, Inc.) at 
approximately 200 uCi/ml for 4-5 hours. 

20 After metabolic labeling, the supematants were harvested from dishes 

and centrifuged to remove any contaminating cells. Metabolically labeled HSA 
expressed and secreted into supematants was detected by immuno- 
precipitation using rabbit anti-HSA antibodies (DAKO - immunoglobulins Cat 
#A001). Supematants were first precleared with 200 ul of 50% slurry of protein 

25 A-Sepharose beads in immunoprecipitation (IPP) buffer (20 mM Tris, pH 8, 150 
mM NaCI, 1% NP-40, 0.1% SDS, 2 ug /ml aprotinin) at 4 °C for 30-60 minutes 
with rocking. Cleared supematants were separated from beads by 
centrifugation and treated with rabbit anti-HSA IgG prebound to protein A- 
Sepharose beads, at 4 °C for 3-4 hours. Beads were washed 6 times with cold 

30 IPP buffer, resuspended in 2 X SDS-PAGE Laemmli sample buffer, heated to 
95 °C for 5 minutes and run on 8% SDS-PAGE gels. Following 
electrophoresis, gels were fixed (10% acetic acid, 25% isopropanol), treated 
with the fluorographic reagent Amplify (Amersham, Inc.), dried onto Whatman 
3MM paper and used to expose X-ray film. Developed films (autoradiographs) 

35 allowed visualization of the relative levels of expression and secretion of 
metabolically labeled HSA from each of the tissue culture transient assay 
plates supported by each of the analyzed BLG/HSA constructs. 
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The first in yilEQ analysis demonstrated that HSA can be expressed from 
constructs containing the HSA cDNA within a BLG construct containing the 
BLG coding as well as BLG 3'-sequences and poiyadenylation poly A site 
5 (p615), an HSA minigene containing intron 1 within constructs lacking the BLG 
coding sequences but which possess either the BLG poly A site (p606), or the 
SV40 poly A site (p608), or an HSA minigene with introns 1 and 2 within a 
construct with the BLG poly A site (p610). The HSA produced from these 
vectors comigrate with HSA produced from a non-BLG transient vector with the 

1 0 HSA cDNA under the control of an SV40 enhancer/Adenovirus major late 
promoter (p582). As expected BLG constructs which lack the SV40 enhancer 
(p600) do not express HSA in vitro. The immunoprecipitation of the band seen 
in this in yjicQ analysis is specific to the anti-HSA serum and is not precipitated 
with a non-specific antiserum demonstrating that the band is in fact HSA. 

1 5 Significantly, the levels of expression increase with the increase in number of 
introns with the cDNA being expressed least and the HSA minigene with 
introns 1 and 2 expressed to the highest level in this group. The levels of 
expression from p606 and p608 (HSA minigenes with intron 1) are equivalent 
indicating that the origin (SV40 or BLG) of the 3'-poly A site does not affect 

20 levels of expression in this assay. 

This previous analysis was performed on constructs which varied in 
components in addition to the HSA intron variations. In order to specifically 
analyze the effect of HSA intron number and position on levels of expression in 

25 this in vitro assay, constructs which vary only in HSA introns (Fig. 3A) were 
tested . Ail of these constructs contain the same BLG 5-flanking sequences 
(promoter) with introduced SV40 enhancer as well as the same 3'-sequences 
(SV40 poly A site). A wide range of levels of expression are obtained from 
constructs with different HSA minigenes. As before, the very low level of HSA 

30 expression with the HSA cDNA (lane 1 ) is increased with the HSA minigene 
with intron 1 (lane 2) and further increased with introns 1 and 2 (lane 3). 
Inclusion of intron 2 alone (lane 9) has a similar effect as both introns 1 and 2 
(lane 3) and clearly intron 2 is more efficacious than intron 1 alone (lane 2). 
This finding demonstrates that specific introns are more effective than others in 

35 providing expression of HSA. In addition, simply increasing the number of 
introns does not necessarily further increase expression as seen by the fact 
that the presence of the last 3 HSA introns, 12-14 (lane 6) resufts in a lower 
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level of expression than intron 2 alone (lane 9) but a similar level as with intron 
1 alone (lane 2). This may be due to the nature of the specific introns and/or to 
their relative positions within the gene, with introns 12-14 at the 3 -end rather 
than the 5'-end. Significantly, similar and higher levels of expression are 
5 obtained with constructs containing either HSA minigenes containing the first 6 
introns (introns 1-6, lane 4), the last 8 Introns (introns 7-14, lane 7) or the entire 
HSA gene with all of its introns, 1-14 (lane 8). While the specific reasons for 
the increased and similar levels of expression obtained when using introns 1-6 
or 7-1 4 are unknown, it is clear that the inclusion of either of these subsets 

1 0 results in expression as high as the inclusion of all HSA introns. The extremely 
high level of expression obtained with an HSA minigene containing introns 2 
and 7-14 (lane 10) demonstrates the synergistic effects of specific intron 
combinations on levels of expression, and that expression of HSA can be 
increased several fold by incorporating these specific intron combinations as 

1 5 opposed to the inclusion of the entire gene with all of its introns. 

The synergistic effects of specific HSA intron combinations on levels of 
expression of HSA (Fig. 3B) were investigated. The same relative levels of 
expression from constructs previously discussed, including the synergistic 

20 effect of introns 2 and 7-1 4 (lane 1 4) where expression is extremely high and 
much higher than what would result from an additive effect of intron 2 (lane 3) 
and introns 7-14 (lane 12). The combination of introns 1 and 7-14 (lane 13) 
was not synergistic since the level of expression supported by this construct is 
about the same as that supported by the construct with only introns 7-14 (lane 

25 12). Additional synergistic combinations were also demonstrated. While the 
levels of expression from constructs with either HSA introns 1 (lane 2) or 
introns 12-14 (lane 8) are very low, introns 1 and 12-14 (lane 9) result in 
significantly higher levels than either alone or the additive effect of both 
together. The levels of expression due to the synergy between introns 2 and 

30 12-14 (lane 10) were even higher. An even greater three part synergy 
involving introns 1 and 2 and 12-14 (lane 11) demonstrating levels of 
expression higher than that expected from an additive effect of the three alone 
or the additive effects of 1 with 12-14 and 2 or 2 with 12-1 4 and 1 . The 
resultant level of expression with introns 1 and 2 and 12-14 was higher than 

35 with introns 1-6 (lane 6) or introns 7-14 (lane 12) or the entire gene with introns 
1-14 (lane 5). The highest level of expression in these experiments was 
supported by a construct with HSA introns 2 and 7-1 4 (lane 14), similar to 
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introns 1 and 2 and 7-14 (lane 15). This was several fold higher than that 
supported by other constructs tested including one with the entire HSA gene 
with all 14 of its introns. 

5 These results demonstrate that in living cells the level of expression of 

HSA is modulated by the specific complement of HSA introns, i.e.., the number 
of HSA introns present in the construct, the specific introns incorporated; the 
relative location of introns, and the synergies between specific introns. 
Several fold higher levels of expression are obtained with constructs 
10 containing HSA minigenes with specific subsets of introns as compared with 
the entire HSA gene with all of its introns or with HSA cDNA. 

ExamplQ 1R 

15 

GENERATION AND IDENTIFICATION OF TRANSGENIC MICE 

A. Collection of Fertilized Eggs 

20 Mice used for the collection of fertilized eggs are the inbred line EBV/N, 

established at the NIH (Proc. Natl. Acad. Sci. USA 88:2065-2069, 1991). They 
were obtained from the National Institute of Health Animal Genetic Resource. 
To induce superovulation, 5-6 week females are injected with 5 i.u. of PMSG 
(Intervet), followed 44-48 hours later by injection of 5 i.u. of Human Chorionic 

25 Gonadotropin (HCG) (Sigma Chemical Company). The females are then 
mated with mature FBV/N males. The following morning, mated females are 
identified by the presence of vaginal plug. The flushing of fertilized eggs from 
the oviduct, treatment with hyaluronidase and culture conditions in M16 or M2 
media are performed as described by Hogan, Costantini and Lacy 

30 "Manipulating and Mouse embryo, A laboratory manual" Cold Spring Harbor 
Laboratory (1 986). 

B. Preparation of DNA for Microinjection 

35 To purify DNA sequences for microinjection, plasmids carrying the BLG 

or BLG/HSA genes were digested with Sal I. BLG and BLG/HSA sequences 
were separated from pGEM sequences by separation on 1.5% agarose gels, 
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electroelution and purification on elutip column (Schleicher & Schuell). DNA 
was suspended in 10 mM Tris pH 7.5 containing 0.5 mM EDTA at a 
concentration of 3 \ig /ml and microinjected into the pronuclei of FBV/N eggs, 
which were subsequently implanted into the oviducts of CDI pseudopregnant 
5 recipient mice as described by Shani (Shani, 1985, Nature 314283-286; 
Shani, 1986, Mol. Cell Biol. 6:2624-2631). 

C. Microinjection 

1 0 injection pipettes are made from 10 cm long, 1 .0 mm outside diameter, 

thin wall, borosillicate glass capillaries with filament (Cat. No. TW100F-4; World 
Precision Instruments, Inc. 375 Quinniplac Ave., New Haven Conn. 06513, 
USA). The holding pipettes are prepared from 9.0 cm long, 1.0 mm outside 
diameter glass capillaries (Cat. No. 105G; Drummond Scientific Co. 500 Pkwy., 

15 Broomall, PA 19008, USA), as described by Hogan, Costantini and Lacy 
"Manipulating the mouse embryo: A laboratory manual" CSHL (1 986). 

Microinjection is carried out in a drop of M42 medium overlaid with 
Silicone oil (Cat No. 6428-R20; Thomas Scientific, P.O. Box 99, Swedesboro, 
20 NJ 08085-0099, USA), in a glass microscope slide chamber. The chamber is 
mounted on the microscope (Diaphot, Nikon) equipped with x20 and x40 
differential interferences contrast (DIC) objectives and x1 0 eyepieces. 3D 
Hydraulic micromanipulators (Cat. No. MN-188, Nikon) are mounted on the 
stage of the microscope. 

25 

DNA (about 1ul) is introduced into the injection pipette at the broad side 
and it is carried to the tip by capillary action along the inner filament. The 
injection capillary is filled up with Flurinet FC77 (Cat. No. F4758; Sigma 
Chemical Company) and mounted onto the micromanipulator via the 
30 instrument collar (Cat No. 070 321 ; Bunton Instrument Co. Inc. Rockville, MD 
20850, USA), which is connected to the hydraulic drive unit (HDU; Bunton 
Instrument Co. Inc.; Rockville, MD 20850. USA) by a tubing (PE-100; Bunton 
Instrument Co. Inc.; Rockville, MD 20850, USA). The holding capillary is 
similarly mounted. The entire set up is filled with Flurinet FC77 (Sigma). 

35 

Batches of 20-30 pronuclear stage eggs are placed in the injection 
chamber. The holding and injection pipettes are brought to the chamber. 
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While the holding pipette picks up the egg, the injection pipette is inserted into 
the pronucleus and about 2 pi of the DNA solution is injected. When all the 
eggs in the chamber are injected, they are harvested and cultured for at least 1 
hour before implantation. 

5 

D. Embryo Transfer 

For routine embryo transfer outbred CD1 females mated with 
vasectomized CD1 males are used. Between 10-15 microinjected eggs are 
1 0 transferred to each oviduct, essentially as described by Hogan, Costantini and 
Lacy "Manipulating the mouse embryo: A laboratory manuaT CSHL (1 986). 

E. Identification of Transgenic Mice 

1 5 Transgenic animals were identified by tail biopsies (2 cm) taken 3 weeks 

after birth. Biopsies were incubated in 1 ml of 50 mM Tris pH 8.0 containing 
0.5% SDS, 0.1 M EDTA and 200 jig proteinase K overnight at 55°C. Genomic 
DNA was purified from the homogenates by extraction with phenol/chloroform. 
Approximately 1 0 mg of DNA from each sample was digested with BamHI, 

20 fractionated on 0.8% agarose gel and transferred to Gene Screen filters (Du 
Pont). Hybridization was performed at 42°C in 50% formamide, with probe 
made from the insert of plasmid p598, 32P-CTP labeled using random primed 
DNA labeling kit (Boehringer Mannheim). Filters were washed with 0.2 x SSC 
containing 1% SDS at 60°C, and exposed to Kodak XAR-5 film at -80°C. (Fig. 

25 4) Lanes are the analysis of DNA from transgenics #9 through #23 (followed 
by-a blank (one) #25 and #26. 

Example 17 

30 ANALYSIS OF MAMMARY GLAND EXPRESSION 

A. Collection and fractionation of milk. 

Milk was collected from nursing transgenic mice 10-12 days after 
35 parturition. Three hours after mothers were separated from their pups they 
were injected intraperitonealy with 0.3 IU oxytocin (Sigma). Milk was collected 
10 minutes later by gentle massage of the mammary gland and taken up in a 
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capillary tube (Clark et al... 1 987, Trends RintPrhnnloav £20-24). Milk 
samples were diluted 1 :5 in water containing 2 mM PMSF and Aprotinin 
(Sigma) and defatted by centrifugation. To prepare whey, the caseins were 
first precipitated by addition of 1 M HCI to pH 4.5. Whey proteins were 
5 subsequently precipitated in 10% trichloroacetic acid (TCA), washed with 
acetone and solubilized in SDS polyacrylamide gel electrophoresis (PAGE) 
sample buffer. 

B. Milk protein analysis. 

10 

Milk proteins were analyzed for the presence of either sheep BLG or 
HSA. Diluted (1:5) and defatted milk collected from lactating G 0 or Gi females 
from the seven transgenic lines (transgenic strain #30,35,37,38,39,40 and 41) 
generated from vector p585 were analyzed for the presence of sheep BLG by 

1 5 immuno-dot blot using rabbit anti-bovine BLG antibodies and iodinated protein 
A (Fig. 5A). The amount of material spotted on the nitrocellulose filters is 
indicated as well as the transgenic strain # from which the milk sample was 
obtained. C indicates control mouse milk. S indicates sheep milk sample. 
BLG indicates purified BLG protein. All seven transgenic lines expressed 

20 sheep BLG in their milk (Fig. 5A and Table 1 ). Expressed levels, ranging from 
about 1.0 mg/ml (lines #37 and #41) to about 8.5 mg/ml (lines #30, 35, and 39) 
were estimated from the intensity of the immuno-dot blot signals as compared 
with BLG standards and corrected for the dilution factor. As expected no signal 
was detected with control mouse milk which does not naturally contain BLG. In 

25 order to determine if levels of expression of BLG could be increased by 
increasing the length of 5-sequences flanking the BLG transcription unit 
transgenic mice were produced from vectors p644 possessing approximately 
5.5 kb of this region. Milk samples from two resultant transgenic lines, 46 and 
48, were analyzed and found to express BLG at levels within the same range 

30 as obtained from transgenics produced from vector p585. Therefore, it appears 
that increasing the 5-flanking region, containing regulatory sequences, from 3 
kb (p585) to 5.5 kb (p644) or to 10.8 kb (p646) did not increase levels of 
expression of BLG. 

35 For the detection of BLG, whey samples were fractionated on 15% SDS 

polyacrylamide gels. Proteins were either stained with Coomassie brilliant 
blue or transferred onto nitrocellulose filter in a Bio Rad trans-blot cell (Bio 
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Rad). Filters were blocked with TBS (20 mM TFUS/100 mM NaCI) containing 
2% Bovine Serum Albumin (BSA, Sigma) and subsequently reacted for 2 
hours with rabbit anti-BLG antiserum (Nordic Immunological Laboratories, 
Capistrano Beach, CA). The complex was incubated with goat anti-rabbit IgG 
5 (Bio Makor, Nes Ziona Israel) and then with rabbit peroxidase anti-peroxidase 
(PAP, Bio Makor). Peroxidase activity was revealed using diaminobenzidine 
as substrate. Alternatively, sheep BLG was detected using 125|-protein A 
following the incubation with anti-BLG antiserum. 

1 0 The whey fraction of milk obtained from the two highest expressing lines 

(30 and 35) were further analyzed by SDS polyacrylamide gel electrophoresis 
(SDS-PAGE) and immunoblot using anti-BLG antiserum. An immunoreactive 
band of approximately 18 kd was detected co-migrating with purified bovine 
BLG and native BLG in sheep milk (Fig. 5B) thus verifying the expression of 

1 5 authentic BLG in the milk of the transgenic lines. The amounts (mg or ml) of 
material loaded on the gel shown on Fig. 5B is indicated as well as the strain 
number from which the milk sample was obtained. C indicates control mouse 
milk. S indicates sheep milk sample. BLG indicates purified BLG protein. 
These results indicated that the basic transgenic vector with 3 kb of 5'-flanking 

20 sequences contained sufficient information to target high level expression of 
protein to the mammary gland of transgenic mice. A summary of results is 
shown in Table 2. 
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Table 2 



Vector 


strain 


FYPPFSSIONOFBLG 
Fxnression* XSStoL 


Strain 


Exnression* 




D585 


30 


8.4 


D644 


■to 


UD 




35 


8.5 




44 


UQ 




37 


1.0 




46 


2J. 




38 


6.0 




48 


±2 




39 


8.3 




49 


UD 




40 


4.7 










41 


1.0 


D646 


52 


1.0-2.0 










54 


1.0-2.0 










56 


UD 


Vector 


Strain 


FXPRFSSIONOFHSA 
Expression* YeCtPl 


Strain 


Expression* 


D575 


1-8 


UD 


D600 


9.11.12.14. 


UD 










16.17 


UD 




D598 


15.18.21.25 


UD 


D599 


19.20.22.24. 


UD 




23 


2.5 




26 


UD 




D607 


27.28 


UD 


D652 


61 


-6-10 




31 


0.005 




62 


0.002 




34 


0.001 




63.65.67 


UD 




36 


0.035 




66 


0.002-0.04 




42 


0.002 




69 


1.5 










71,72,73,74 


UD" 










75 


UD 




D643 


45.47 


UD 














D654 


77 


UD 




p647 


50,51,53 


UD 




76,78,79,80 


UD" 




58.64 


UD 










59 


0.002 


p686 


81 
89 


UD 
0.001 








p687 


82 
83 
86 

84. 85 87.88 


0.01 
5 

0.005 
UD 


p812 


104 

103. 105 
106 


0.8-1.0 
not yet 
determined 


p696 


ao 

91 

92 
93 


UD 
2.5 
0.07 
0.002 



* mg/ml ; "less than 0.001 mg/mi; "UD" means less than 0.001 mg/mL 
5 "Not yet determined" means that transgenic has been produced but the presence 

of HSA in the rriDc has not yet been determined. 

"Low lever means that preliminary evaluation by dot blot suggested the presence 
of HSA in the milk of transgenic animals. Subsequent quantitation of the level of 
HSA in the milk reGed on a level of 0.001 mg/ml or greater to be considered 
10 detectable. 
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USA was detected in milk samples fractionated on 7.5% SDS or native 
polyacrylamide gels. Proteins were either stained with Coomassie blue or 
transferred onto nitrocellulose filters. The filters were blocked with 3% gelatin at 
37°C and then reacted overnight at room temperature with iodinated anti-HSA 
5 monoclonal antibodies. After extensive washings with TBS containing 0.5% 
tween, filters were exposed to Kodak XAR-5 film at -80°C. The initial attempt to 
produce transgenic mice expressing HSA in their milk was by introducing the 
HSA cDNA into the 5-untranslated region of the first exon of the BLG gene of 
vector p585, resulting in vector p575 (Fig. 2A). The milks of lactating females 

1 0 from 8 transgenic lines produced from vector p575 were analyzed for the 
presence of HSA by immuno-dot blot using iodinated anti-HSA monoclonal 
antibodies. None of the 8 lines secreted detectable levels of the human protein 
(Tables 1 and 2). It appeared that although the BLG vector was able to drive 
expression of its own BLG gene, it was unable to support the expression of the 

1 5 inserted HSA cDNA. Therefore, a series of vectors was tested in which the 
sheep BLG promoter was fused to HSA minigenes possessing either their first 
or first and second introns within their native sites of the HSA cDNA (Rg. 2A). 
Vector p599 differs from vector p575 only by the presence of HSA intron 1. 
Vector p600 also includes an HSA minigene with intron 1 , but has had the BLG 

20 coding sequences deleted though it maintains the untranslated BLG exon 7 
with its polyadenylation signal and site as well as BLG 3'-flanking sequences. 
In vector p598, with an HSA minigene containing intron 1 . BLG coding 
sequences, exon 7 and 3'-flanking sequences were deleted and replaced with 
an SV40 polyadenylation signal and site. Vector p607 is similar to p600 

25 except that it includes both HSA introns 1 and 2. 

From a total of 16 individual transgenic lines produced from vectors with 
an HSA minigene with intron 1 (p599, p600, p598), only one (#23 from p598) 
expressed detectable levels of HSA in its milk (Rg. 6A and Table 2). The top 

30 row represents the spotting of the indicated amounts (ng) of commercially 
purified HSA (Sigma). The middle and bottom rows represent the spotting of 
milk samples from the indicated transgenic mouse strains, control mouse (C), 
human milk (H) and sheep milk(S). The milk from line 23 was estimated to 
contain about 2,000-3,000 ng /ml HSA as determined by comparison of its 

35 signal with HSA standards in the immuno-dot blot (Fig. 6B). The top row 
represents the spotting of the indicated amounts(ng) of purified HSA. The 
middle and bottom rows represent the spotting of indicated amounts of milk 
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samples from indicated transgenic mouse strains, control mouse milk (C) and 
human milk (HM). Significantly, four of the six transgenic lines produced from 
vector p607, containing an HSA minigene with its first 2 introns, expressed 
detectable levels of HSA in their milk, ranging from 1 to 35 u.g /ml (Table 1). 
5 Several transgenic strains generated from vector p652, which contains an HSA 
minigene with introns 1-6, expressed HSA in their milk. One of these strains 
expressed 1.5 mg/ml while a second expressed 6-7 mg/ml. 

Similar levels of HSA expression were supported by constructs which 
1 0 contained the same HSA minigene whether they possessed the SV40 or BLG 
poly A site. 

Milk samples from the 5 expressing lines, as identified by immuno-dot 
assay were subjected to SDS-PAGE and immunoblot (Fig. 6C). In the figure, 

15 HSA represents analysis of commercial HSA. HM represents human milk. 
HSA 23 represents analysis of milk sample from transgenic strain #23. An 
immunoreactive band co-migrating with purified HSA (65 kd) was detected in 
the milk of all immuno-dot positive lines. Densitometry of the autoradiograms 
confirmed the quantitative estimates of HSA based upon the immuno-dot blot 

20 Mouse milk contains a significant amount of endogenous mouse serum 
albumin which co-migrates with human serum albumin in SDS-PAGE gels. 
However, as demonstrated in the immuno-detection assays (Fig. 7A and 7B), 
the anti-HSA monoclonal antibody specifically detected the human protein and 
not the mouse protein. In Fig. 7A and 7B, HM represents human milk, HSA 

25 represents commercially purified HSA (Sigma) and C represents control 
mouse milk. The human and mouse proteins were also distinguishable by 
their distinct electrophoretic mobilities on native polyacrylamide gels. Milk from 
expressing line 23 clearly contains both human (low mobility) and mouse (high 
mobility) albumin as seen by generalized protein staining with Coomassie (Fig. 

30 7A). The lower mobility band was confirmed to be HSA by native gel and 

immunoblot analysis (Fig. 7B). A summary of the expression is shown in Table 
2. 



35 



C. Expression of HSA RNA in different tissues of transgenic mice. . 

In order to examine the tissue specificity of expression of HSA RNA total 
RNA was isolated from various tissues of transgenic female mice on day 10-12 
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of lactation. Total RNA from various tissues of transgenic lactating mice was 
isolated by the UCl/Urea procedure. RNA (10-15 mg) was fractionated on 
MOPS/formaldehyde agarose gels and blotted onto a nylon filter and 
hybridized to a 32 P-labeled anti-sense RNA probe synthesized with the RNA 
5 labeling kit (Boehringer Mannheim), according to the supplier's protocol, using 
HSA cDNA in pSK (Stratagene) plasmid. The HSA probe crosshybridizes to 
endogenous mouse serum albumin mRNA in liver. 

Two patterns of HSA RNA expression were observed as represented by 
1 0 line 23, produced from vector p598, whose milk contains large quantities of 
HSA and line 19, produced from vector p599, whose milk contains no 
detectable HSA (Fig. 8). B represents brain; H represents heart; K represents 
kidney; L represents liver; Lu represents lung; M represents mammary gland; S 
represents spleen; and SK represents skeletal muscle. In line 23 transcripts of 
1 5 the transgene were clearly detected in the mammary gland and to a lesser 
extent in skeletal muscle. No detectable signal was found in the other tissues 
examined, even after a long exposure of the autoradiogram. The HSA 
transgene RNA migrated slightly slower than the endogenous mouse serum 
albumin mRNA (2070 ribonucleotides). This is consistent with an expected 
20 transgenic mRNA size of about 2230 ribonucleotides composed of the 

untranslated portion of BLG exon 1 from its cap site to the site of introduction of 
the HSA minigene, the HSA transcription unit itself minus introns 1 sequences 
removed by splicing, and SV40 sequences upstream of its polyadenylation 
site. 

25 

In mouse line 19, as well as six of the transgenic lines carrying vector 
p575 (all of whose milk contains no HSA), transgene transcripts were not 
detected in the mammary gland. However, significant levels of transcripts were 
found in the kidney. Their higher mobility than the endogenous mouse albumin 
30 mRNA indicates an RNA smaller than the size expected (2783 ribonucleotides) 
of a polycistronic mRNA composed of both HSA and BLG sequences as would 
be produced from vectors p599 and p575. Endogenous mouse serum albumin 
mRNA was also detected in the kidney of control mice. 

35 D. In silu hybridization 
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lasilu hybridization was performed on paraffin sections of mammary 
glands of virgin and lactating transgenics and control mice as described in 
Sassoon, D., Lyons, G., Wright, W., Lin, V., Lassar, A., Weintraub, H. and 
Buckingham, M. 1989, Nature 341:303-307. The probe used was 35s-UTP 
5 labeled antisense RNA synthesized from the HSA cDNA with T7 polymerase. 
(Fig. 9) Panel A shows a section of lactating mammary gland of transgenic 
strain HSA #23. Alveoli are marked "AL". Panel B shows the same section as 
A viewed under dark-field illumination. Panel C shows a section of virgin 
mammary gland of transgenic mouse strain #23; panel D is a dark field view of 
1 0 the same section. A duct is marked "Du". Panel E shows a section of 

mammary gland from a control non-transgenic mouse; and panel F shows the 
same section under dark-field illumination. 

E. Explant studies 

15 

Explant cultures of mammary glands of virgin and lactating mice were 
performed as described by Pittius. CW., Sankaran, L, Topper, YJ., and 
Hennighausen, L 1988. Mol. Endocrinol 2:1027-1032. Briefly, after mincing, 
pieces of approximately 1 mm were cultivated on lens paper floats in serum 

20 free M199 medium. For hormonal stimulation, bovine insulin (0.1 or 5 ug /ml), 
hydrocortisone (0.1 or 5 ug /ml) and ovine prolactin (1 or 5 jig /ml) were added 
to the medium. All hormones were purchased form Sigma (ST. Louis, MO). 
The medium was collected for several days, and then screened for the 
presence of HSA or other milk proteins. (Fig. 10) HSA represents 

25 commercially purified HSA signal. C represents control mouse explants. I 
represents insulin; P represents prolactin; F represents hydrocortisone. First 
set IP, 1FP, IF, I) represent treatments with the lower concentrations of 
hormones. Second set represent treatments with the higher concentration of 
hormones. 

30 

Example 18 

IMMUNOHISTOCHEMICAL DETECTION OF HSA AND B-CASEIN IN 
MAMMARY GLANDS OF TRANSGENIC AND CONTROL MICE 

35 

For immunohistochemical staining, mammary glands were fixed in 4% 
paraformaldehyde at 4°C for 16 hrs. The fixed material was dehydrated in a 
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series of 70-100% ethanols, cleared in chloroform and embedded into paraffin. 
Sections (5 u.M) were cut on a paraffin microtome and mounted on 3- 
aminopropyltrietoxysiiane-treated slides. Mounted sections were 
deparaffinized in xylene (2 changes, 5 min each), processed through a 100- 
5 70% series of ethanols, washed in PBS (5 min) and incubated for 1 0 min in 
absolute methanol containing 0.3% H2O2 for the inhibition of endogenous 
peroxidase activity. The sections were then immersed in PBS for 5 min and 
incubated for 30 min in PBS containing 3% normal goat serum. The sections 
were incubated sequentially with the following reagents: (I) rabbit anti-HSA 

10 antiserum diluted 1 :1000 or rabbit anti-mouse p-casein antiserum diluted 
1 :1 000, or normal rabbit serum diluted 1 :1 000; (2) goat anti-rabbit IgG 
antiserum diluted 1:100; (3) rabbit peroxidase-antiperoxidase (PAP) complex 
diluted 1:500. Each incubation (45 min) and all the dilutions were prepared in 
PBS containing 3% normal goat serum and 3% normal mouse serum. 

1 5 Between treatments the slides were washed with 3 changes (5 min each) of 
PBS. After the final washing the peroxidase activity was revealed by 
incubating the sections in saline buffered with 0.05 M Tris-HCI (pH 7.4) 
containing 0.05 M imidazole, 0.05% diaminobenzidine-HCI and 0.01% H2O2 
for 1-3 min. The sections were briefly washed in distilled water, lightly counter 

20 stained with hematoxylin, dehydrated through ascending ethanols, cleared in 
xylene and coverslipped. 

Figure 1 1 shows the pattern of HSA (A, B) and casein (C, D) 
immunostaining of sections of virgin (A, C) and iactating (B, D) transgenic mice 

25 of strain #23 (which express HSA in their milk). In virgin transgenic glands 
casein staining is concentrated in the luminal surfaces of epithelial cells lining 
the ducts, a pattern similar to that found with control non-transgenics (Figure 
13), while HSA staining is concentrated not only in the apical part of the 
epithelial cells but is also found over the epithelial cell cytoplasm. In Iactating 

30 glands HSA and casein co-localize to the apical parts of the epithelial cells. 

Figure 1 2 shows the pattern of HSA (A, B) and casein (C, D) 
immunostaining of sections of virgin (A, C) and Iactating (B, D) transgenic mice 
of strain #69 (which also express HSA in their milk). In virgin glands both HSA 
35 and casein staining are confined to the luminal surfaces of epithelial cells. 

while in Iactating glands there is a striking difference between the two patterns. 
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In some ducts only a small proportion of the cells are positive for HSA whereas 
casein staining is uniform and all the epithelial cells are stained- 
Figure 13 shows the pattern of HSA (A, B) and casein (C, D) 
5 immunostaining in sections of virgin (A, C) and lactating (B f D) control non- 
transgenic mice. As expected there is no detectable staining with the anti-HSA 
antiserum. The pattern of staining of casein is similar to that found with the 
transgenic strains. 

10 Example 19 

CROSSBREEDING OF TRANSGENIC STRAINS WHICH EXPRESS HSA IN 
MILK RESULTS IN DOUBLE TRANSGENICS WHICH EXPRESS HIGHER 
LEVELS OF HSA THAN EITHER PARENT STRAIN 

15 

A crossbreeding was performed between transgenic strain #23 
(generated from vector p598, expresses 2.5 mg/ml of HSA in milk) and strain 
#69 (generated from vector p652, expresses 1.5 mg/ml of HSA in milk) 
resulting in a double transgenic strain, #1000, which possesses both 
20 transgenes. Quantitation by dot blot immunoanalysis. as previously described, 
demonstrated that strain #1000 females produce milk with HSA at a 
concentration of 4-5 mg/ml. 

Example 2Q 

25 

CULTURE AND ANALYSIS OF MAMMARY EXPLANTS 

Explants were prepared from the abdominal and thoracic mammary 
glands essentially as described by Topper et aL (Meth. Enzymol 39, 443-454, 

30 1975), with the following modifications. Explants were maintained in medium 
199, pH 7.2, containing 25 mM HEPES, 2.2 g/liter NaHC0 3 , 200,000 lU/liter 
penicillin, 200 mg/Irter streptomycin and 10 mg/liter neomycin. Sixteen to 
twenty explants, 1-2 mm 3 in size, were incubated for the indicated time period 
on impregnated lens paper, floated in 35 mm culture dishes with 2 ml of 

35 medium containing the indicated concentrations of insulin, hydrocortisone and 
prolactin in various combinations. Media was changed daily. 
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Proteins were analyzed either by Western immunoblot analysis or 
metabolic labeling. For Western immunoblot analysis aliquots from the 
collected medium were precipitated with 1 0% TCA, and washed with ethanol 
and acetone. Pellets were dissolved in SDS-PAGE sample buffer and were 
5 fractionated on 7.5% SDS polyacrylamide gels. Proteins were either stained 
with Coomassie blue or transferred onto nitrocellulose filters in a Bio-Rad 
trans-blot cell (Bio Rad). Filters were blocked with TBS (20 mM Tris, pH 7.5, 
500 mM NaCI) containing 3% gelatin for 3 h at 37°C and subsequently reacted 
overnight with iodinated anti-HSA monoclonal antibodies at room temperature. 

1 0 After extensive washings with TBS containing 0.5% tween, the filters were 
exposed to Kodak XAR-5 film at -80°C. For BLG detection, membranes were 
reacted with anti-BLG antibody, then with anti-rabbit IgG, and extrAvidin- 
alkaline Phosphatase conjugate. The complex was stained with NBT/BCIP 
(Bio-Makor Nes Ziona Israel) according to supplier instructions. De novo 

15 synthesized proteins were metabolically labeled as follows. Thirty minutes 
before labeling, medium was changed to methionine-free DME medium. The 
explants were then incubated for 4.5 hours in 1 ml methionine-free medium 
containing 150 uCi/ml 35[S]-methionine. The collected explants were extracted 
with 50 mM Tris-HCI, pH 8.0. 5 mM NaCI, 0.5% Nonidet P-40 (NP-40), 3 ug/ml 

20 PMSF and 1% aprotinin. Following extraction, lysates were clarified for 10 min 
at 10,000 x g and total protein synthesis was determined as TCA insoluble 
radioactivity. Fractions (300-500 of explant lysates or medium containing 
0.5 x 10 6 or 0.25 x 10 6 cpm, respectively, were pre-absorbed on Protein A- 
Sepharose CL-4B beads (Pharmacia Fine Chemicals, Uppsala, Sweeden) for 

25 30 min. Immunoprecipitation was performed with rabbit anti-human serum 
albumin polyclonal antibody or rabbit anti-BLG antiserum (Nordic Immunology, 
Tilburg, The Netherlands) for 1-2 hour at 0°C, followed by binding to Protein A- 
Sepharose CL-4B beads for 30 min at 0°C. After extensive washing with 50 
mM Tris-HCI, pH 7.4, 500 mM NaCI, 5 mM EDTA, 0.5% NP-40 and 5% sucrose, 

30 the pellets were washed once with the same buffer without NP-40 and sucrose, 
suspended in SDS-PAGE sample buffer, electrophoresed on 7.5 % 
polyacrylamide gel and exposed to Kodak XAR-5 films, following flourography. 

Example 21 

35 



EXPRESSION OF HSA, BLG AND 0-CASEIN RNA AND PROTEIN IN THE 
MAMMARY GLANDS OF VIRGIN, PREGNANT AND LACTATING MICE 
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In order to determine the effects of different hormone combinations on 
the synthesis and secretion of HSA and BLG from mammary explants of virgin 
mice, cultured over time, de novo synthesized proteins were metabolically 
labeled immediately upon explant (dO) or on the 5th day of culture (d5). 
5 Following metabolic labeling, approximately half of the explants from each 
sample were collected and 2 ml of fresh medium 199 containing about 1000- 
fold excess of cold methionine chase was added to the remaining explants. 
Collected explants and media were processed and immunoprecipitated as 
previously described. The remaining explants were processed similarly after 

10 24 hours of chase (d1 and d6, respectively). Aliquots of media and lysates 
representing equal tissue weight were analyzed by SDS-PAGE and 
fluorography. The results are shown in Figure 16 with the top part showing the 
analysis of explants from transgenic strain #23 and the bottom, transgenic 
strain #35. Figure designations are: -, no hormones; I, insulin; P, prolactin; F, 

1 5 hydrocortisone. 

HSA was produced and secreted from mammary explants on day 0 in 
the presence or absence of hormones, while BLG was not produced under 
either condition immediately upon explant. In the presence of insulin and 

20 prolactin, explants cultured for 5 days continued to produce and secrete HSA. 
Hydrocortisone had a minor effect. The combination of insulin and prolactin, 
with or without hydrocortisone, appears to have had a stabilizing effect on 
expressed HSA since a higher proportion was observed after 24 hours of cold 
chase on day 6 than on day 1 . BLG production and secretion was observed 

25 only on day 5 of culture and only in the presence of insulin and prolactin. 

Hydrocortisone had a minor effect. We have also found that the production of 
p- and a1 -casein in BLG transgenics and control mice followed the hormonal 
control pattern of BLG in BLG transgenic strain #35 (data not shown). These 
milk proteins were also induced by a combination of insulin and prolactin. 

30 However, we found the production of caseins from explants of HSA transgenic 
strain #23 on day 0. 

Example 23 

35 CORRELATION BETWEEN LEVELS OF EXPRESSION OF HSA IN MILK AND 
LEVELS OF EXPRESSION AND SECRETION OF HSA FROM MAMMARY 
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EXPLANTS OF VIRGIN FEMALES AMONG DIFFERENT STRAINS OF 
TRANSGENIC MICE 

In order to determine whether we could use levels of expression of HSA 
5 from mammary explants of virgin transgenic females to predict whether specific 
strains of transgenics will ultimately express HSA in their milk, and to estimate 
that level, upon lactation, we investigated several explant culture parameters. 
In a series of experiments we determined the best time points, culture 
conditions and means of estimation of HSA production from explants, that will 

1 0 best reflect the ia^tt secretion of HSA in the milk of several transgenic strains. 
Figure 17 shows a Western immunoanalysis of HSA secreted into milk of a 
number of BLG/HSA hybrid gene transgenic mouse strains. Three ul of 
defatted diluted (1 :5) milk samples were analyzed on SDS PAGE. Proteins 
were blotted onto nitrocellulose membrane and reacted with iodinated anti- 

15 HSA monoclonal antibodies as previously described. Relative levels of HSA in 
each milk were estimated in relative densitometry scan units. Figure 18 shows 
the results of an explant experiment where mammary explants from various 
HSA transgenic strains were metabolically labeled either immediately upon 
explanation, or on day 5 in culture in M-199 medium containing insulin (5 

20 ug/ml) and prolactin (5 ug/ml). Explant lysates and media were pooled from 
duplicate plates which contained explants from 3-4 virgin animals. Equal 
amounts of either total TCA precipMable protein cpm from the medium or lysate 
(A), or amounts of medium or lysate representing equal tissue weight (B) were 
analyzed by immunoprecipitation with anti-HSA antibodies, SDS-PAGE, 

25 fluorography and densitometry scanning of autoradiographs as previously 
described. Levels of HSA were represented in relative densitometry scan 
units. 

The correlations (r values) between HSA secretion in the milk and levels 
30 of expression from explants are summarized in Table 3. From this table it can 
be seen that there is a relatively high correlation between HSA secretion to the 
milk and HSA synthesis and secretion in explants at days 0 or 5 of culture. 
Mammary explants from mouse strains which do not secrete HSA in their milk 
also do not produce and secrete HSA from their explants. Explants from mice 
35 which secrete high levels of HSA in their milk do produce the HSA in culture. 
While it appears that any of the conditions tested produce high correlations, it 
appears that the best parameters for this work is to culture explants from 
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transgenics for 5 days in the presence of insulin and prolactin (5 tig/ml each) 
and to measure and compare sss-labeled HSA levels secreted to the medium 
on the basis of equal amount of TCA precipitable protein. Figure 19 shows the 
graph comparing levels of expression under these conditions. 

5 

These data suggest that this type of explant expressional analysis can 
be used on a biopsy of a virgin transgenic farm animal mammary gland in 
order to forecast the potential of that animal to express the transgene protein, 
as well as to estimate the level of expression, in its milk later in its 
1 0 development, upon lactation. 
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TABLE 3 

Correlation (r values) between HSA secretion 
in milk and synthesis and secretion by 
5 mammary explants of virgin animals 1 



Parameter 1 


Parameter 2 


Day 3 


Basis for Analysis 


Correlation 4 
(rvalue) 


HSA in milk 


HSA in media 


0 


Equal cpm/lane 


0.79 


HSA in milk 


HSA in iysates 


0 . 


Equal cpm/lane 


0.83 


HSA in milk 


HSA in media 


5 


Equal cpm/lane 


0.93 


HSA in milk 


HSA in Iysates 


5 


Equal cpm/lane 


0.87 


HSA in milk 


HSA in media 


0 


Equal tissue 


0.87 








weight/lane 




HSA in milk 


HSA in iysates 


0 


Equal tissue 


0.90 








weight/lane 




HSA in milk 


HSA in media 


5 


Equal tissue 


0.92 








weight/lane 




HSA in milk 


HSA in Iysates 


5 


Equal tissue 


0.89 








weight/lane 





1 0 1 Transgenic strains #76, 81, 86, 92, 69, 83, 23 and 1000 were evaluated. 

^Expression of HSA expressed by explants as determined in either media or Iysates. 
3 Day in culture of explant upon analysis. 

^Correlations (r values) were determined using Sigmaplot software. 
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ExamnlP PA 

PRODUCTION OF TRANSGENIC GOATS 

5 A. Induction of Superovulation 

Embryos were recovered from Saanen goat does that have been 
induced to superovulate by treating them for 1 2 days with intravaginal sponges 
impregnated with 30 mg of fluorogestone acetate (Chrono-Gest, Intervet 

1 0 International B.V.-Boxmeer-Holland). From the evening of the 9th day after 
sponge insertion, 5 intramuscular injections of follicle stimulating hormone 
(FSH-P, Schering Corp.) were administered every 12 hours (5, 4, 3, 3 and 2 
mg FSH-P). On the evening of the 11th day when the last FSH injection was 
administered, the sponges were withdrawn. Does were checked for estrous 

15 every 4 to 12 hours, beginning 12 hours after sponge removal. They were 
mated by 2 bucks at 20 and 36 hours after sponge withdrawal. One to two cell 
eggs were collected about 62 hours after sponge removal. The recipients are 
similarly synchronized with intravaginal sponges. On the evening of the 1 1th 
day sponges are removed and the recipient does are injected with 500 units of 

20 PMSG (Intervet). 

B. Surgery 

Does were taken off food (36 hours) and water (12 hours) prior to 
25 surgery. Anesthesia is induced by intravenous injection of thiopentone sodium 
and maintained by mixtures of oxygen (1-2 liters/minute) and halothane (1-2%) 
(Halocarbon Laboratories, N. Augusta, SC.). The reproductive tract is exposed 
through mid-central incision and a glass catheter inserted into the oviduct 
through the fimbria. Five ml PBS containing 5% FCS were introduced into the 
30 uterine lumen through a blunted 1 8-gauge needle and forced through the 
uterotubal junction and along the oviduct. 

C. Microinjection 

35 DNA (1 -4 ug /ml) is injected into one pronucleus of 1 or 2 cell eggs 

placed in a chamber filled with ovum culture medium (Row Labs, Irving, 
Scotland) containing 20% FCS and covered with Flurinet 70 (Sigma). 
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Pronuclei are visualized using the Nikon Diaphet Inverted microscope 
equipped with Nomarski optics at x400. Eggs are microinjected essentially as 
described by Hogan et aL (Manipulating the mouse embryo- a laboratory 
manual, Cold Spring Harbor Laboratory, 1986). Surviving embryos were 
5 surgically transferred to the oviduct of recipient does using the Socorex 1 -5 uJ 
micropipette. Up to 10 embryos are transferred to each recipient. 



Fyamole 25 

10 

HSA EXPRESSION FROM EXPLANTS OF MAMMARY GLANDS OF AN 
ABORTED TRANSGENIC BABY GOAT AND A DEAD NEWBORN 
TRANSGENIC BABY GOAT 

1 5 Two female transgenic goats were produced. However, one (goat #1) 

was spontaneously, prematurely, aborted about 1-2 weeks before expected 
delivery and found approximately 24 hours after it had aborted. Upon finding, it 
was stored at 4°C. Laboratory tests indicated that the mother suffered from Q- 
fever (a disease caused by rickettsia that damages the placenta, but not the 

20 embryo, and leads to spontaneous abortion). Goat #2 was also 

spontaneously, prematurely, aborted about 1 week before expected delivery, 
lived for approximately 24 hours and then died for unknown reasons. It was 
also stored at 4°C. Its mother did not appear to be ill. A few hours after goat #1 
was found and the death of goat #2, a sample of their mammary tissue was 

25 removed into a small container filled with M-1 99 medium. Samples were cut 
into smaller pieces (explants) which were incubated on impregnated lens 
paper in 35 mm tissue culture flasks with 2 ml of medium in the absence or 
presence of 5 ug/ml, each, of insulin (I), hydrocortisone (F) and prolactin (P). 
Medium was replaced and collected every 24 hours and stored at -70°C. 

30 Aliquots (300 u,Q were lyophilized, solubilized in sample buffer and analyzed 
on SDS-PAGE gels. The proteins were either blotted onto nitrocellulose 
membrane for Western immunoassay (Figure 20A) or stained with Coomassie 
brilliant blue (Figure 20B). The nitrocellulose membrane was blocked with 3% 
gelatin in TBS buffer (10 mM Tris. 500 mM NaCI) and hybridized overnight with 

35 iodinated anti-HSA monoclonal antibodies in TBS buffer containing 1% gelatin 
and 0.2% Tween 20. The membrane was washed in the same buffer and 
exposed to XAR film at -70°C. 
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Figure 20 shows that an immunoreactive protein of the same size of 
purified HSA was secreted from explants of goat #1 during the first 24 hours of 
culture in the presence of IFP. By day 3 of incubation HSA was reduced to just 
5 at or below the level of detection. We had also found that mammary explants of 
BLG/HSA transgenic mice also expressed high levels of HSA during the first 
24 hours of culture but that this declined greatly with time in culture. We could 
not detect secretion of caseins from explants of goat #1, although |J- 
lactoglobutin appears to be expressed in the medium of explants from the first 
1 0 day of culture in the presence of IFP (Figure 20B). 

Mammary explants from goat #2 did not secrete detectable levels of 
HSA during the first day in culture or during subsequent days (Figure 20B). 
Traces of two other milk proteins could be detected, but may represent a non- 
15 specific hybridization due to the overloading of proteins. While p-lactoglobulin 
was detected in the media of explants of goat #1 from the first day of culture, p- 
lactoglobulin was not clearly detected until the 6th day of culture from explants 
of goat #2. 
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pppnftft nf Strains Useful in Practicing the Invention 

Deposits of biologically pure cultures of the following strains were made 
5 under the Budapest Treaty with the American Type Culture Collection, 1 2301 
Parklawn Drive, Rockville, Maryland, The accession numbers indicated were 
assigned after successful viability testing, and the requisite fees were paid. 

Access to said cultures will be available during pendency of the patent 
10 application to one determined by the Commissioner of the United States Patent 
and Trademark Office to be entitled thereto under 37 C.F.R §1 .1 4 and 35 
U.S.C. §122, or if and when such access is required by the Budapest Treaty. 
All restriction on availability of said cultures to the public will be irrevocably 
removed upon the granting of a patent based upon the application and said 
1 5 cultures will remain permanently available for a term of at least five years after 
the most recent request for the furnishing of samples and in any case for a 
period of at least 30 years after the date of the deposits. Should the cultures 
become nonviable or be inadvertently destroyed, they will be replaced with 
viable cuitures(s) of the same taxonomic description. 

20 

stmin/piasmid ATCC No. Deposit Date 

p652.2 68653 July 25.1991 

p696.9 68654 25-1991 

25 

One skilled in the art will readily appreciate the present invention is well 
adapted to carry out the objects and obtain the ends and advantages 
mentioned, as well as those inherent therein. The peptides, antibodies, 
methods, procedures and techniques described herein are presented as 
30 representative of the preferred embodiments, or intended to be exemplary and 
not intended as limitations on the scope of the present invention. Changes 
therein and other uses will occur to those of skill in the art which are 
encompassed within the spirit of the invention or defined by the scope of the 
appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Hurvitz, David R 
Nathan, Margret 
Shani, Moshe 

(ii) TITLE OF INVENTION: Transgenic Protein Production 
(iii) NUMBER OF SEQUENCES: 1 



15 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Rhone-Poulenc Rorer, Inc. 

(B) STREET: 500 Virginia Ave., Bldg. 3A 

(C) CITY: Ft. Washington 

(D) STATE: Pennsylvania 
20 (E) COUNTRY: USA 

(F) ZIP: 19034 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
25 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 
30 (A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
35 (A) NAME : Goodman , Ros anne 

(B) REGISTRATION NUMBER: 52,534 

(C) REFERENCE /DOCKET NUMBER: A0856-US 

(ix) TELECOMMUNICATION INFORMATION: 
40 (A) TELEPHONE: (215) 962-4130 

(B) TELEFAX: (215) 962-4107 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
50 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



60 



(x) PUBLICATION INFORMATION: 

(A) AUTHORS: Minghetti, P P 
Ruffner, D E 
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Kuang, W.-J- 
Denhison, O E 
Hawkins, J W 
Beattie, W G 

5 Dugaiczyk, A 

(B) TITLE: MOLECULAR STRUCTURE OP THE HUMAN ALBUMIN GENE 

IS REVEALED BY NUCLEOTIDE SEQUENCE WITHIN Qll-22 
OF CHROMOSOME 4 

(C) JOURNAL: J. Biol. Chem. 
10 (D) VOLUME: 261 

(P) PAGES r 6747-6757 
(G) DATE: 1986 

(K) RELEVANT RESIDUES IN SEQ ID NO:l: FROM 1 TO 19002 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 





WMWiwXAAlv 




A 1 AA XvaAAA X 


AAAAxAACAx 


ft^nfti^ft ft ft ft^^ft 
AxAxAAAAxA 


XAAXAwAXAA 






ft ft fn^^^n^nft 
UAaAIAjX xAb 


AAx UxAuAUi 


x x wAxU I 1 AA 


ft ^^t^^^flrt^^ ft 
VjAviC ruAL 1 


AAlAWtl 1A1 


LI XoAAAAAX? 






XAAwAXAAAl 


TTCACTACTA 


ft^p^^ftf^ft ft^^^nft 
AX lAlAAv.XA 


AlAlvlu 1 AA 


LAAuAwii X A 


1VU> X IvwiL 




25 


ulnnAV> X Inn 






XAAXAAl Xl*X 


AwAX X Aw X X A 




240 

^ w w 


CACACACACA 


CACACACACA 


CACGAGATAT 


AAAAAAAAAA 


AAAATAAACA 


AACAAACAAA 


300 




CAAACAAACA 


AACAAACAAA 


CAAACAAACA 


AACAAACAAA 


CGAAAGATAT 


AAAAAAAAAA 


360 


30 


AAAAAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 


AAGAAATGAA 


ATATAATATC 


ATATATATGA 


420 




TATATATATA 


TATAAATTCC 


TATAGACTCA 


ATTCCTATAG 


ACTCTATAGA 


CTAATTCTAT 


480 


35 


AGACTCCTAT 


AGACTCAATT 


CCTATAGACT 


CCTATAGACT 


CAATTCCTAT 


AGACTCAATA 


540 




CTACGTGTAT 


TCAGCCCTTT 


CCCAGGGACT 


TCTACAAGGA 


AAAAGCTAGA 


GTTGGTTACT 


600 


40 


GACTTCTAAT 
AGAAAGAATT 


AAATAATGCC 


TACAATTTCT 


AGGAAGTTAA 


AAGTTGACAT 


AATTTATCCA 
GTAGGTTTTT 


660 
720 
780 


ATTTTCTTAA 


CTTAGAATAG 




CTTTTCAGAT 




CTGGCTTTAG 


AAAAAATGCT 


TU'mTi'crr 


CAATGGAAAA 


TAGGCACACT 


TGTTTTATGT 


45 


CTGTTCATCT 


GTAGTCAGAA 


AGACAAGTCT 


GGTATTTCCT 


TTCAGGACTC 


CCTTGAGTCA 


840 




TTAAAAAAAA 


TCTTCCTATC 


TATCTATGTA 


TCTATCATCC 


ATCTAGCTTT 


GATTTTTTCC 


900 




TCTTCTGTGC 


TTTATTAGTT 


AATTAGTACC 


CATTTCTGAA 


GAAGAAATAA 


CATAAGATTA 


960 


50 


TAGAAAATAA 


TTTCTTTCAT 


TGTAAGACTG 


AATAGAAAAA 


Ai-mxrinx: 


ATTATAAGAC 


1020 




TGAGTAGAAA 


AAATAATACT 


TTGTTAGTCT 


CTGTGCCTCT 


ATGTGCCATG 


AGGAAATTTG 


1080 


55 


ACTACTGGTT 


TTGACTGACT 


GAGTTATTTA 


ATTAAGTAAA 


ATAACTGGCT 


TAGTACTAAT 


1140 




TATTGTTCTG 


TAGTATCAGA 


GAAAGTTGTT 


CTTCCTACTG 


GTTGAGCTCA 


GTAGTTCTTC 


1200 




ATATTCTGAG 


CAAAAGGGCA 


GAGGTAGGAT 


AGCTTTTCTG 


AGGTAGAGAT 


AAGAACCTTG 


1260 


60 


GGTAGGGAAG 


GAAGATTTAT 


GAAATATTTA 


AAAAATTATT 


CTTCCTTCGC 


TTTGxTITEA 


1320 
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GACATAATGT TAAATTTATT TTGAAATTTA AAGCAACATA AAAGAACATG TGATTTTTCT 
ACTTATTGAA AGAGAGAAAG GAAAAAAATA TGAAACAGGG ATGGAAAGAA TCCTATGCCT 
GGTGAAGGTC AAGGGTTCTC ATAACCTACA GAGAATTTGG GGTCAGCCTG TCCTATTGTA 
TATTATGGCA AAGATAATCA TCATCTCATT TGGGTCCATT TTCCTCTCCA TCTCTGCTTA 
ACTGAAGATC CCATGAGATA TACTCACACT GAATCTAAAT AGCCTATCTC AGGGC1TGAA 
TCACATGTGG GCCACAGCAG GAATGGGAAC ATGGAATTTC TAAGTCCTAT CTTACTTGTT 
ATTGTTGCTA TGTCTITITC TTAGTTTGCA TCTGAGGCAA CATCAGCTTT TTCAGACAGA 
ATCGCTTTGG AATAGTAAAA AAGACACAGA AGCCCTAAAA TATGTATGTA TGTATATGTG 
TGTGTGCATG CGTGAGTACT TGTGTCTAAA TTTTTCATTA TCTATAGGTA AAAGCACACT 
TGGAATTAGC AATAGATGCA ATTTGGGACT TAACTCTTTC AGTATCTCTT ATTTCTAAGC 
AAAGTATTTA GTTTCGTTAG TAATTACTAA ACACTGAGAA CTAAATTGCA AACACCAAGA 
ACTAAAATCT TCAAGTGGGA AATTACAGTT AAATACCATG GTAATCAATA AAAGGTACAA 
ATCGTTTAAA CTCTTATGTA AAA1TTGATA AGATGTTTOA CACAACTTTA ATACATTGAC 
AAGGTCTTGT GGAGAAAACA GTTCCAGATC GTAAATATAC ACAA66GATT TAGTCAAACA 
ATTOTTTGGC AAGAATATTA TGAATTTTGT AATCGGTTGG CAGCCAATGA AATACAAAGA 
TGAGTCTAGT TAATAATCTA CAATTATTGG TTAAAGAAGT ATATTAGTGC TAATTTCCCT 
CCGTTTGTCC TAGCTTTTCT CTTCTQTCAA CCCCACACGC CTTTGGCACA ATCAAGTCGG 
TAACGTTTAT TTCCCTTCTT TTTCTCTTTA GCTCGGCTTA TTCCAGGGGT GTGTTTCGTC 
GAGATGCACG TAAGAAATCC ATTTTTCTAT TGTTCAACTT TTATTCTATT TTCCCAGTAA 
AATAAAGTTT TAGTAAACTC TGCATCTTTA AAGAATTATT TTGGCATTTA TTTCTAAAAT 
GGCATAGTAT TTTGTATTTG TGAAGTCTTA CAAGGTTATC TTATTAATAA AATTCAAACA 
TCCTAGGTAA AAAAAAAAAA AGGTCAGAAT TCTTTAGTCA CTGTAATTTT CTTTTCCGCA 
CTAAGGAAAG TGCAAAGTAA CTTAGAGTGA CTGAAACTTC ACAGAATAGG GTTGAAGATT 
GAATTCATAA CTATCCCAAA GACCTATCCA TTGCACTATG CTTTATTTAA AAACCACAAA 
ACCTGTGCTG TTGATCTCAT AAATAGAACT TGTATTTATA TTTATTTTCA TTTTAGTCTO 
TCTOCTTGGT TCCTGTrGAT AGACACTAAA AGAGTATTAG ATATTATCTA AGTTTGAATA 
TAAGGCTATA AATATTTAAT AATTTTTAAA ATAGTATTCT TGGTAATTGA ATTATTCTTC 
TGOTTAAAGG CAGAAGAAAT AATTGAACAT CATCCTCACT TTTTCTGTAG GAATCAGAGC 
CCAATATTTT GAAACAAATG CATAATCTAA GTCAAATGGA AAGAAATATA AAAAGTAACA 
TTATTACTTC TTGTTTTCTT CAGTATTTAA CAATCCTTTT TTTTCTTCCC TTCCCCAGAC 
AAGAGTGAGG TTGCTCATCG GTTTAAAGAT TTGGGAGAAG AAAATTTCAA AGCCTTCTAA 



1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 
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GTTAAAATAT TGATGAATCA AATTTAATGT TTCTAATAGT GTTGTTTATT ATTCTAAAGT 3240 

GCTTATATTT CCTTGTCATC AGGGTTCAGA TTCTAAAACA GTGCTGCCTC GTAGAGTTTT 3300 

CTGCGTTGAG GAAGATATTC TGTATCTGGG CTATCCAATA AGGTAGTCAC TGGTCACATG 3360 

GCTATTGAGT ACTTCAAATA TGACAAGTGC AACTGAGAAA CAAAAACTTA AATTGTATTT 3420 

AATTGTAGTT AATTTGAATG TATATAGTCA CATGTGGCTA ATGGCTACTG TATTGGACAG 3480 

TACAGCTCTG GAACTTGCTT GGTGGAAAGG ACTTTAATAT AGGTTTCCTT TGGTGGCTTA 3540 

CCCACTAAAT CTTCTTTACA TAGCAAGCAT TCCTGTGCTT AGTTGGGAAT ATTTAATTTT 3600 

I'mTm TT TAAGACAGGG TCTCGCTCTG TCGCCCAGGC TGGAGTGCAG TGGCGCAATC 3660 

TCGGCTCACT GCAAACTCCG CTCCCGGGTT CACGCCATTC TCCTGCCTCA GCCTCCCGAG 3720 

TAGCTGGGAC TACAGGCGCC CGCCATCACG CCCGGCTAAT CTTTTGTATT TTTAGTAGAG 3780 

ATCGGGTTTC ACCGTGTGCC AGGATGGTCT CAATCTCCTC ACATCGTGAT CTGCCCACCT 3840 

CGGCCTCCCA AAGTGCTGGG ATTACAGGAG TGAGTCACCG CGCCCGGCCT ATTTAAATGT 3900 

OTTTTAATCT AGTAAAAAAT GAGAAAATTG TTTTTTTAAA AGTOTACCTA ATCCTACAGG 3960 

CTAATTAAAG ACGTGTGTGG GGATCAGGTG CGGTGGTTCA CACCTGTAAT CCCAGCACTT 4020 

TGGAAGGCTG ATGCAGGAGG ATTGCTTGAG CCCAGGAGTA CAAGACCAGC CTGGGCAAGT 4080 

CTCTTTAAAA AAAACAAAAC AAACAAACAA AAAAATTAGG CATGGTGGCA CATGCCTGTA 4140 

GTCCTAGCTA CTTAGGAGGC TGACGTAGGA GGATCGTTTG GACCTGAGAG GTCAAGGCTA 4200 

CAGTGAGCCA TGATTGTGCC ACTGCACTCC AGCCTGGGTG ACAGAGTGAG ACTCTGTCTC 4260 

AAAAAAGAAA AAGGAAATCT GTGGGGTTTG TTTTAGTTTT AAGTAATTCT AAGGACTTTA 4320 

AAAATGCCTA GTCTTGACAA TTAGATCTAT TTGGCATACA ATTTGCTTGC TTAATCTATG 4380 

TGTGTGCATA GATCTACTGA CACACGCATA CATATAAACA TTAGGGAACT ACCATTCTCT 4440 

TTGCGTAGGA AGCCACATAT GCCTATCTAG GCCTCAGATC ATACCTGATA TGAATAGGCT 4500 

TTCTGGATAA TGCTGAAGAA GATGTATAAA AGATAGAACC TATACCCATA CATGATTTGT 4560 

TCTCTAGCGT AGCAACCTGT TACATATTAA AGTTTTATTA TACTACATTT TTCTACATCC 4620 

TTTGTTTCAG GGTGTTGATT GCCTTTGCTC AGTATCTTCA GCAGTGTCCA TTTGAAGATC 4680 

ATGTAAAATT AGTGAATGAA GTAACTGAAT TTGCAAAAAC ATGTGTTGCT GATGAGTCAG 4740 

CTGAAAATTG TGACAAATCA CTTGTAAGTA CATTCTAATT GTGGAGATTC TTTCTTCTGT 4800 

TTGAAGTAAT CCCAAGCATT TCAAAGGAAT TTTTTTTAAG TTTTCTCAAT TATTATTAAG 4860 

TGTCCTGATT TGTAAGAAAC ACTAAAAAGT TGCTCATAGA CTGATAAGCC ATTGTTTCTT 4920 

TTGTGATAGA GATGCTTTAG CTATGTCCAC AGTTTTAAAA TCATTTCTTT ATTGAGACCA 4980 
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AACACAACAfJ r TY" , A r IY2r , TV" ,r PIl rrrpm> * *rrv*»y>o 
nnwiwnnwui A i\j i/\ 11 1 AAA JL v»t»W 


AATTTGTCAT 


TTATAAACAC 


CTcrrrrrAA 


5040 




riAx x iuavaj i i i\?v9 1 1 iu 1 1 1TI\STAGAGG 


CTAATAGGGA 


TATGATAGCA 


TGTATTTATT 


5100 


5 


J -'» A a Arti i iv-l l Ai 1 1 1A 1 1 AT AGT AAG 


AACCCTTAAC 


ATGAGATCTA 


CCCTGTTATA 


5160 




* * ■*> J- Artf*v* ±vj iH^iUii^tiil 1 A 1 1 Va 1 I AAC 


TACGGGTACA 


CTGTTGTATA 


GCTTACTCAT 


5220 


10 


w l A\j\~ lUlAi iAAAAL X 1 I\5 TGCCCAxTGA 


TTAGTAACCC 


CTCGTTTCGT 


CCTCCCCCAG 


5280 


wwAw1\*w-wAA wwAGCA'ITAT AL l\.l"H\aAT 


TCTATGAGTT 


TGACTACTTT 


AGCTACCTTA 


5340 




XAXAnUlvAjl AllAlvvlALT GiiTATCTIl 


TTATGACTGA 


CTTATTTCCC 


TTAGCATAGT 


5400 


15 


<^>wAl IVAAAb i^-CAACCATG 1 l\al IViCCTA 


TTGCAGAATT 


TCCTTCTTTT 


CAAGGCTGAA 


5460 




TAATATTCCA GTGCATGTGT GTACCACATT 


TTCTTTATCC 


ATTAATTTGT 


TGATTGATAG 


5520 


20 


ACATTTAGGT TGGTTTTCTA CATCTTGACT 


ATCATGAATA 


GTGTTGCAAT 


GAACACAGGA 


5580 


GAGCTACTAT CTCTTAGAGA TGATATCATG 


GTTTTTATCA 


TCAGAAAACA 


CCCACTGATT 


5640 




TCTATGCTAA TTTTGTTACC TGGGTGGAAT 


AATAGTACAG 


CTATATATTC 


CTCATTTTAG 


5700 




ATATCTTTGT ATTTCTACAT ACAATAAAAA 


AGCAGAGTAC 


TTAGTCATGT 


TGAAGAACTT 


5760 




TAAACTTTTA GTATTTCCAG ATCAATCTTC 


AAAACAAGGA 


CAGGTTTATC 


TTlXritriCAC 


5820 


ou 


CACTCAATCT ATATATACCT CTTCTGGGCA 


AGGCCAGTTT 


TTATCACTCG 


AGCCTTTCCC 


5880 


Ci-i-rrrATTA TGTACCTCTC CCTCACAGCA 


GAGTCAGGAC 


TTTAACTTTA 


CACAATACTA 


5940 




l\afe>t~ruiAwi TATGAAATCT TAAAAATACA 


TAAAAATTAA 


TAAATTCTGT 


CTAGAGTAGT 


6000 


35 


AlAi 1 1 1\tV9\Mj 1 1 A\Jvi AllACl 1 rCA 


TAATAAAAAT 


TAGAGATAAG 


GAAAGGACTC 


6060 




AlliAirGGA AAGTGATTTT AGGTAACATT 


TCTGGAAGAA 


AAATGTCTAT 


ATCTTAATAG 


6120 


40 


l\*Aw 1 1 AA 1 A 1ATGATGGAT XVj\l\£TTACTC 


CTCAGTTTTC 


AATGGCATAT 


ACTAAAACAT 


6180 








CTCCCCTAGG 


6240 




wvav.L.ui\-iAA AAAbUviGGwi AATGAAATGA 


GAAACTCTCT 


GAATGTTTTT 




TGAATTCAPC? p FY2f , * r PrtPT' r P71/^ MArv*fnm»mfrm 
lvaAAA i.^»**ww 1\tC1oL1 IAw aagl;i 1A1 1 1 


TCTCTTGATT 


TCTGTTATAA 


TGATTGCTCT 


6300 


45 


TACCCTTTAG TTTTAAOTTT P7Afca.7AT7Artr , 5A 


GTCATATAAC 


TTTCCTTAAA 


GCTATTGACT 


6360 




GTCTTTTTGT CCTGTTTTAT TCACCATGAG 


'I'l^STM **IV"*OV * 
1 iAliUjl\jl\> 


ACAGTTAATT 


CTTATGAAAA 


6420 


50 


TTATATAGAG ATGGTTAAAT CATCAGAAAC 


lo 1 AAAL.L. 1\J 


GATTGGGAGG 


GGAAGCGGAT 


6480 


TTTTAAATCA TTTCCTGACC AAGCTTAACC 


AGTATATTAA 


ATCCTTTGTA 


CTGTTCTTTG 


6540 




GCTATAAAGA AAAAAGGTAC TGTCCAGCAA 


CTGAAACCTG 


CTTTCTTCCA 


TTTAGCATAC 


6600 


55 


CCTTTTTGGA GACAAATTAT GCACAGTTGC 


AACTCTTCGT 


GAAACCTATG 


GTGAAATGGC 


6660 




TGACTGCTGT GCAAAACAAG AACCTGAGAG 


AAATGAATGC 


TTCTTGCAAC 


ACAAAGATGA 


6720 


60 


CAACCCAAAC CTCCCCCGAT TGGTGAGACC 


AGAGGTTGAT 


GTGATGTGCA 


CTGCTTTTCA 


6780 


TGACAATGAA GAGACATTTT TGAAAAAGTA 


AGTAATCAGA 


TGTTTATAGT 


TGAAAATTAA 


6840 
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AAAGCATGGA GTAACTCCAT AGGCCAACAC TCTATAAAAA TTACCATAAC AAAAATAOTT 6900 

TCAACATTAA GACTTGGAAG TTTTGTTATG ATGATTTTTT AAAGAAGTAG TATTTGATAC 6960 

5 

CACAAAATTC TACACAGCAA AAAATATGAT CAAAGATATT TTGAAGTTTA TTGAAACAGG 7020 

ATACAATCTT TCTGAAAAAT TTAAGATAGA CAAATTATTT AATGTATTAC GAAGATATGT 7080 

10 ATATATGGTT GTTATAATTG ATTTCGTTTT AGTCAGCAAC ATTATATTGC CAAAATTTAA 7140 

CCATTTATGC ACACACACAC ACACACACAC ACACTTAACC CTTTTTTCCA CATACTTAAA 7200 

GAATGACAGA GACAAGACCA TCATGTGCAA ATTGAGCTTA ATTGGTTAAT TAGATATCTT 7260 

15 

TGGAATTTGG AGGTTCTGGG GAGAATGTCG ATTACAATTA TTTCTGTAAT ATTGTCTGCT 7320 

ATAGAAAAGT GACTGTTTTT CTTTTTCAAA ATTTAGATAC TTATATGAAA TTGCCAGAAG 7380 

20 ACATCCTTAC TTTTATGCCC CGGAACTCCT TTTCTTTGCT AAAAGGTATA AAGCTGCTTT 7440 

TACAGAATGT TCCCAAGCTG CTGATAAAGC TGCCTGCCTG TTGCCAAAGG TATTATGCAA 7500 

AAGAATAGAA AAAAAGAGTT CATTATCCAA CCTGATTTTG TCCATTTTGT GGCTAGATTT 7560 

25 

AGGGAACCTG AGTGTCTGAT ACAAACTTTC CGACATGGTC AAAAAAGCCT TCCTTTTATC 7620 

TGTCTTGAAA ATCTTTCATC TTTGAAGGCC TACACTCTCG lUn^TTCTT T TAAGATTTGC 7680 

30 CAATGATGAT CTGTCAGAGG TAATCACTGT GCATGTGTTT AAAGATTTCA CCACTTTTTA 7740 

TGGTGGTGAT CACTATAGTG AAATACTGAA ACTTGTTTGT CAAATTGCAC AGCAAGGGGA 7800 

CACAGTTCTT GTTTATCTTT TCATGATAAT TTTTAGTAGG GAGGGAATTC AAAGTAGAGA 7860 

35 

ATTTTACTGC ATCTAGATGC CTGAGTTCAT GCATTCATTC CATAAATATA TATTATGGAA 7920 

TGCTTTATTT TCTTTTCTGA GGAGTTTACT GATGTTGGTG GAGGAGAGAC TGAAATGAAT 7980 

40 TATACACAAA ATTTAAAAAT TAGCAAAATT GCAGCCCCTG GGATATTAGC GTACTCTTTC 8040 

TCTGACTTTT CTCCCACTTT TAAGGCTCTT TTTCCTGGCA ATGTTTCCAG TTGGTTTCTA 8100 

ACTACATAGG GAATTCCGCT GTGACCAGAA TGATCGAATG ATCTTTCCTT TTCTTAGAGA 8160 

45 

GCAAAATCAT TATTCGCTAA AGGGAGTACT TGGGAATTTA GGCATAAATT ATGCCTTCAA 8220 

AATTTAATTT GGCACAGTCT CATCTGAGCT TATGGAGGGG TGTTTCATGT AGAATTTTTC 8280 

50 TTCTAATTTT CATCAAATTA TTCCTTTTTG TAGCTCGATG AACTTCGGGA TGAAGGGAAG 8340 

GCTTCGTCTG CCAAACAGAG ACTCAAGTCT GCCAGTCTCC AAAAATTTGG AGAAAGAGCT 8400 

TTCAAAGCAT GGTAAATACT TTTAAACATA GTTGGCATCT TTATAACGAT GTAAATGATA 8460 

55 

ATGCTTCAGT GACAAATTGT ACATTTTTAT GTATTTTGCA AAGTGCTGTC AAATACATTT 8520 

CTTTGGTTCT CTAACAGGTA GAACTCTAAT AGAGGTAAAA ATCAGAATAT CAATGACAAT 8580 

60 TTGACATTAT TTTTAATCTT TTCTTTTCTA AATAGTTGAA TAATTTAGAG GACGCTGTCC 8640 
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TTTTTGTCCT AAAAAAAGGG apagatattt 


AAV? lit 1A1 1 


TATTTATAAA 


ATC1TGGACT 


8700 




CTTATTCTAA TGGTTCATTA tttttatapa 

**** * x a X J. >v X X X 1 X A 1 Aun 


Lrt-1 Vj 1 ALA* vA 


TGGTTCTTTA 


TTTAATTTTT 


8760 


5 


TAAAGTTATT TTTAATTTTT GTYSG AT AP AG 

* * XrtrtA X X X X VI 1 UVj/il AWM 


A ftT A GGT A T< A 


CATATTTACG 


GGGTATATGA 


8820 




GATATTTTGA TATAAGTATA PAAPatatat 


AAlVLVx 1 1A 


TTTAATTTTA 


TC1TCCCCCC 


8880 


10 


AATGATCTAA AACTATTTGP r P r TV* r Pe*'f*i"i»i«i' 


ATGTCU7TATA 


GTTAAATTCA 


GTCACCAACT 


8940 


AAGTrGAAGT TirTTTTTRT T*T»T*tv«r»»m»^ 


CTCCAGCTCT 


GATCTTCATC 


TCAMTm 1 


9000 




GPP TG AfiPPT* riY^TTTTVlini 71 mrp> Mnm Am 


TGGTTCTGGG 


AGCATACTTT 


AATAGCCGAG 


9060 


15 


T*PAAGAAAAA 'PA ^TiTS /T^TV^O r^rnnvi^/i^ 

iwAhwuuiAA l AL l AvivTGC C CCGTC ACCC 


ACACTCCTCA 


CCTGCTAGTC 


AACAGCAAAT 


9120 




WiKLALAALA WjAAA IAAAA TGAAAATAAT 


AGACATTATG 


CATGCTCTCT 


AGAAACTGTC 


9180 


20 


***» A X\jf*LrtA»» X\j 1 Al 1 lbv» iwi 1 V A 1 xCCT AC 


CATCTACACC 


ACCAAAATCA 


ACCAAATTTA 


9240 


loAAMAAAAA AVALjUvJCCAA CATAAAATTA 


TACACAGATA 


AACAGGCTAT 


GATTGGTTTT 


9300 




^5^*^^ A Iv 7\ ^ 7i 7A f % ^^^n^ntv%% m _i _in_i. ■ j 

vjooAAAUAAta IvACCi itAC CTGATTTAGG 


CAACTGTGAA 


ATGACTAGAG 


AATGAAGAAA 


9360 


25 


AiiAbiM.bli 1 ACATC TTGT CATAGAG1 1 1 


GAAGATAGTG 


CTGGATCTTT 


CTTTTTATAA 


9420 




\j lhjujjH. ivaa i AAAAAvJTCC CTCATTCTGT 


AGAAGTTATG 


ATTTCTTTTC 


TAAGAGACCT 


9480 


30 


x lAUAhu Iwi oAftAAAAlvi i GT^TCAATTG 


AGAAAAAAGA 


TAACTGGAGT 


TTGTGTAGTA 


9540 


u x i\.UUUjA1 lAi AAAA I VV X"l"l"iV* 1'ATGT 


ATTATCTAAT 


TTAATCCTCA 


AAACTTCTTC 


9600 




***** * **wwii ullvjlwilviA VAL. XU^AXaAo 


GCTGAAGCTC 


AGAGACGCTG 


AGCCCTCTGC 


9660 


35 


TAAC AAGTPP T APTYTPT* A A P a fc r»nv"* * * * 
x nrturt/wj x v~v- x AU iw 1 AAV AAJL? IviAl AAA 


GCCAGAGCTG 


GAAGTCACAT 


CTGGACTCCA 


9720 




AAPPTY? ATYJP TTV*TPA^2PP'P r**VTY*r*r*rT*rrT\ 
nAwwiuftiVA, x XVXVAVsVVl \aL IvjCCCLI 1 


TTAGAGTTCC 


TTTTTAATTT 


CTGCTTTTAT 


9780 


40 


GACTTGP/PAG ATTTP'FAPP'P 7v o/"* tv r> t\ r"» t\ * 
A x w\* x nw a x x xv x AVyACACAGA 


CTCTTAAATG 


GATAATTCTG 


CCCTAAGGAT 


9840 


AAGTGAT'PAP P ATTTY.IP'F'TY"* A^5\7A/^rn7\/i> * 


CTAATGAATT 


TTAAAAATTA 


TTTCTGTATG 


9900 




TCCATTTK3A AT I * ! 'ITTT A T PAr^Ti 7\rp 
* > " w *» A * * * axxxxvxxax uAoAAA 1 Alii 


ATTTGCCTAG 


TGTTTTCATA 


TAAAATATCG 


9960 


45 


CATGATAATA CCATTTTGAT TGGPGA r r r n r r r n 


vi'x'i'iTAGGG 


CAGTAGCTCG 


CCTGAGCCAG 


10020 




AGATTTCCCA AAGCTGAGTT TGCAGAAGTT 


TVPA APTTiTir"* 


TGACAGATCT 


TACCAAAGTC 


10080 


50 


CACACGGAAT GCTGCCATGG AGATCTGCTT 


GAATGTGPTG 


A?TY2 AP Af2f2P*T» 


AAAGAGTCGT 


10140 




twAiAibCi x i l ivGTAGCT TGCATGCTCA 


AGTTGGTAGA 


ATGGATGCGT 


TTGGTATCAT 


10200 




TGGTGATAGC TGACAGTGGG TTGAGATTGT 


CTTCTGTGCT 


TTCGTCTGTC 


CTATCTTCAA 


10260 


55 


TCTTTCCCTG CCTATGGTGG TGGTACCTTT 


CTGTTTTTAA 


CCTGCTATAA 


ATTACCAGAT 


10320 




AAACCCATTC ACTGATTTGT AACTCCTTTC 


AGTCATGCTC 


TAACTGTAAA 


TGAAGGCTTA 


10380 


60 


AACTGAAGTA GAACAGTTAC AAGGTTTTAC 


TTGGCAGAAC 


ATCTTGCAAG 


GTAGATGTCT 


10440 


AAGAAGATTT riTlTlvTIT TTTTAAGACA 


GAGTTTCGCT 


CTTCTTTCCC 


AGGCTGGGGT 


10500 
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GCAATGGTGT GATCTTGGCT CAGCGCAACC TCTGCCTCCT GGGTTCAAGT GATTTTCATG 10560 

CCTCAGCCTC CCAAGTAGCT GGGATTACAG GCATGCGCCA CCACACCTGG CTAATTTTGT 10620 

ATTTTTAGTA GAGGCGGGGT TTCACCATAT TGTCCAGACT GGTCTCGAAC TCCTGACCTC 10680 

AGGTGATCCA CCCGCCTTGG CCTCCCAAAG TGCTGGGATT ACAGGCATGA GCCACCTTGC 10740 

CCAGCCTAAG AAGATTTTTT GAGGGAGGTA GGTGGACTTG GAGAAGGTCA CTACTTGAAG 10800 

AGATTTTTGG AAATGATGTA TTTTTCTTCT CTATATTCCT TCCCTTAATT AACTCTGTTT 10860 

GTTAGATGTG CAAATATTTG GAATGATATC TCTTTTCTCA AAACTTATAA TATTTTCTTT 10920 

CTCCCTTTCT TCAAGATTAA ACTTATGGGC AAATACTAGA ATCCTAATCT CTCATGGCAC 10980 

TTTCTGGAAA ATTTAAGGCG GTTATTTTAT ATATGTAAGC AGGGCCTATG ACTATGATCT 11040 

TGACTCATTT TTCAAAAATC TTCTATATTT TATTTAGTTA TTTGGTTTCA AAAGGCCTGC 11100 

ACTTAATTTT GGGGGATTAT TTGGAAAAAC AGCATTGAGT TTTAATGAAA AAAACTTAAA 11160 

TGCCCTAACA GTAGAAACAT AAAATTAATA AATAACTGAG CTGAGCACCT GCTACTGATT 11220 

AGTCTATTTT AATTAAGTGG GAATGTTTTT GTAGTCCTAT CTACATCTCC AGGTTTAGGA 11280 

GCAAACAGAG TATGTTCATA GAAGGAATAT GTGTATGGTC TTAGAATACA ATGAACATGT 11340 

TCTGCCAACT TAATAAAGGT CTGAGGAGAA AGTGTAGCAA TGTCAATTCG TGTTGAACAA 11400 

TTTCCACCAA CTTACTTATA GGCGGACCTT GCCAAGTATA TCTGTGAAAA TCAAGATTCG 11460 

ATCTCCAGTA AACTGAAGGA ATGCTGTGAA AAACCTCTGT TGGAAAAATC CCACTGCATT 11520 

GCCGAAGTGG AAAATGATGA GATGCCTGCT GACTTGCCTT CATTAGCTGC TGATTTTGTT 11580 

GAAAGTAAGG ATGTTTGCAA AAACTATGCT GAGGCAAAGG ATGTCTTCCT GGGCATGTAA 11640 

GTAGATAAGA AATTATTCTT TTATAGCTTT GGCATGACCT CACAACTTAG GAGGATAGCC 11700 

TAGGCTTTTC TGTGGAGTTG CTACAATTTC CCTGCTGCCC AGAATGTTTC TTCATCCTTC 11760 

CCTTTCCCAG GCTTTAACAA TTITTGAAAT AGTTAATTAG TTGAATACAT TGTCATAAAA 11820 

TAATACATGT TCACGGCAAA GCTCAACATT CCTTACTCCT TAGGGGTATT TCTGAAAATA 11880 

CGTCTAGAAA CATTTTGTGT ATATATAAAT TATGTATACT TCAGTCATTC ATTCCAAGTG 11940 

TATTTCTTGA ACATCTATAA TATATGTGTG TGACTATGTA TTGCCTGTCT ATCTAACTAA 12000 

TCTAATCTAA TCTAGTCTAT CTATCTAATC TATGCAATGA TAGCAAAGAA GTATAAAAAG 12060 

AAATATAGAG TCTGACACAG GTGCTTTATA TTTGGTGAAA AGACCAGAAG TTCAGTATAA 12120 

TGGCAATATG GTAGGCAACT CAATTACAAA ATAAATGTTT ACGTATTGTC AGAAGTTGTG 12180 

GTGATAAACT GCATTTTTGT TGTTGGATTA TGATAATCCA CTAAATAATA TTTCCTAAAA 12240 

TTATGTACCC TACAAGATTT CACTCATACA GAGAAGAAAG AGAATATTTT AAGAACATAT 12300 
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CTCTGCCCAT CTATTTATCA GAATCCTTTT 


GAGATGTAGT 


TTAAATCAAA 


CAAAATGTTA 


12360 




ATAAAAATAA CAAGTATCAT TCATCAAAGA 


CTTCATATGT 


GCCAAGCAGT 


GTGTGCTTTG 


12420 


5 


rGTAGATTAT GTCATATAGT TCTCATAATC 


CACCTTCCGA 


GACAGATACT 


ATTTATTTTT 


12480 




rGAGACAGAG ttttactctt gttgcccagg 


CTGGAGTGCA 


ATGGTGCCAT 


CTCGGCTCAC 


12540 


10 


wACAACCTTC GCCTCCCAGG TTCAAGCGAT 


TCTCCTGCCT 


CAGCCTCCTG 


GGATTACAGG 


. 12600 


CATGCACCAC CATGCCTGGC TAATTTTGTA 


TTTTTAGTAG 


AGATGGGGTT 


TCACCATGTT 


12660 




GGTCAGACTG GTCTCAAACT CCTGACCTCT 


GGTGATATGC 


CTGCCTCAGC 


CTCCTAAAGT 


12720 


ID 


GCTGGGATTA CAGGCATGAG CCACTGTGCC 


CAGCCGACAG 


ATACTATTAT 


TATTTCCATT 


12780 




CTACCGAGAA GGAGACTAAG GCTCTGATCA 


TTTAAATAAG 


TTGCCTAAGG 


TGATGCAGTC 


12840 


20 


ATATAAGTAG CAGAGCTAGG AATTGAGCCT 


TGGTAACTTT 


AACTCTGGAC 


CCCAAGTCCT 


12900 




TAGCTACTAA GCTTTACTGC ATGGGGTTTA 


GTCAAATTAA 


GACTTTTGGA 


ATATGAGTTA 


12960 




CTTTTGAtiAT TAGCTTTGTG ATATTTTTTC 


TGCTCATTTG 


TCCAACAAAG 


TCTATTTTAT 


13020 


do 


TTTCATCTTA ATTAGGTTTT TGTATGAATA 


TGCAAGAAGG 


CATCCTGATT 


ACTCTGTCGT 


13080 




GCTGCTGCTG AGACTTGCCA AGACATATGA 


AACCACTCTA 


GAGAAGTGCT 


GTGCCGCTGC 


13140 


30 


AGATCCTCAT GAATGCTATG CCAAAGTGGT 


AGGTTTATTG 


TTGGAAAAAA 


ATGTAGTTCT 


13200 




TTGACTGATG ATTCCAATAA TGAGAAAGAA 


AAATAATGCA 


AGAATGTAAA 


ATGATATACA 


13260 




GTGCAATTTA GATCTTTTCT TGAGATGGTT 


TCAATTCTGG 


AATCTTAAAC 


ATGAAAGAAA 


13320 




AAGTAGCCTT AGAATGATTA ACAAAATTTA 


GACTaGTTAG 


AATAGAAAGA 


TCTGAATAGA 


13380 




GCAATCTCTA AAAAATTTTG ATCTTTTTTT 


CTCTTTTTCA 


CAATCCTGAG 


AACAAAAAAA 


13440 


40 


AATTAAATTT AAATGTTAAT TAGAAGATAT 


TTAACTTAGA 


TGTAAAGTGA 


GTTAACCTGA 


13500 




TTCCAGGATT AATCAAGTAC TAGAATTAGT 


ATCTTATGGC 


AAATTATAGA 


ACCTATCCCT 


13560 




TTAGAATATT TTCAAATCTT TTTGAGGATG 


TTTAGGAATA 


GTTTTACAAG 


AAATTAAGTT 


13620 


45 


*vj^m\ja\*\jJ\a A'lTJU.vi'ivi^ GAGGATTTTT 


AGGGTTCCCA 


CTAGCATATC 


TAATGGTTTC 


13680 




lVjrt/itiAJ. AKjAATCAGAG AAAACTCATT 


TTTCCTGCTT 


TCAAGAAGCT 


ACTGTATGCC 


13740 


50 


Au^aUAULii i\a W\UAAAt-AAT G AC C AACGT A 


AAATCTCTCA 


TTTTGGAGAG 


CCTGGAATCT 


13800 




AACTGGAAAG GTGAACTAAT AATAATAATA 


TGTACAATCA 


TAGCCATCAT 


TTATTAAACT 


13860 




TTTATTATAT GCAAGGCACT GTTTAATTTC 


ATTAGCTTAC 


CTGGTTTACA 


GAGCAGCTCT 


13920 


55 


ATGAGATGAG TGCCATCTTT GCCCCTATTT 


TAGGGATAAG 


GATTCCGAAA 


TGTGGAGATG 


13980 




GTAAGTAAAA TTGCACAACT GAAGAATGAG 


TTACATGACT 


TGGCTCAAAT 


ACTGGTCATT 


14040 


60 


GAACTCCAGA GCCTGAATAT TCTTAACCAC 


TTACATGATG 


CAAGCTCACC 


AAATAAATAG 


14100 




TTCGAATGTA TTGTCACAGA GCGGCATTGA 


TATTCATCTA 


TTCATGTGGC 


TTTGAGTAGG 


14160 
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AAGAAGAAAG GATATCATTC TGACCAGAGG GGTGAAAAAC AACCTGCATC TGATCCTGAG 14220 

GCATAATACT ATTAACACAA TTCTTTTATG TTTCAGTTCG ATGAATTTAA ACCTCTTGTG 14280 

GAAGAGCCTC AGAATTTAAT CAAACAAAAT TGTGAGCTTT TTGAGCAGCT TGGAGAGTAC 14340 

AAATTCCAGA ATGCGTAAGT AATTTTTATT GACTGATTTT TTTTATCAAT TTGTAATTAT 14400 

TTAAGACTTA ATATATGAGC CACCTAGCAT AGAACTTTTA AGAATGAAAA TACATTGCAT 14460 

ATTTCTAATC ACTCTTTGTC AAGAAAGATA GGAGAGGAGA GATAAAATAG TTGATGGGGT 14520 

GGAGAGGTCT ATATTTGAAT GTAGTCTAAA AATTGTTCTC TTAAGATTGG AAGTATCTAG 14580 

GCTGGGAGGG TAAATACCAA ATCTTGGTAT ATCAGAACTG AGCATGTCCC TTGAAGGTTA 14640 

AGAAATAGTT AATGGGCAAA TAGAGCATGG CAATATTTTG TAGAGCAGCA AGTAGTAGGC 14700 

CTTGAATAGA TGTCGCTCAA AAAGTAATAT GTAAGCTGAA CACAAAAATG TAACAAATGA 14760 

ATTTAGATAC ATATTTGAAT ATTAAATTCA GGTTGTTTGG GAGATGCACC TAGTCTTTGA 14820 

TGGTTAAACC TTTCCCTCCA TAGAAGAGAC AGAGACAGAA TGGCTTGCTG GACTAATGTC 14880 

CCAATTCAAT AGAGTCTTAT CTACGAAGGT TAAAAACAAG AAGAGACATA TTATACAGTA 14940 

GATATTTATT GTGTGGCTCA TACACATGGT GCTCTTCTGA TTATGGATTT TAGAGATAAT 15000 

AACAGTGAAC AAGACATAGT TTCTTTCCTC GAGTAGATTA AAGTCATACA TTGACTTTTA 15060 

ATGGTGACTG GCATTCTTAA TACATGATTA TTATATATTA GGTACCATGT CAGATTAATT 15120 

ATAATACTTT ACTATTTTTA ATTTAACCCT TGAACTATCC CTATTGAGTC AGATATATTT 15180 

CCTTCCATTT TCTACTTGTA TCTTTCAAGT TTAGCATATG CTGATACATA TGAAGCTCTC 15240 

TCCAGGTTTT ATTGAAAGAA GAAATTAATA AATTTATTAA TGTCACTGAA TTAGGCAACT 15300 

CACTTTCCCA AGATTATGCA AGTGGTACAG GTGGAACTCA AAGCCAAGTT TAACTAGTTG 15360 

TTCAGGAGAA TGTTTTCTAC CCTCCACTAA CCCACTACTC TGCAGATGGA GATAATATGA 15420 

TGAATGGAAC ATAGCAACAT CTTAGTTGAT TCCGGCCAAG TCTTCTCTGT TTTATCTACT 15480 

ATGTTAGACA GTTTCTTGCC TTGCTGAAAA CACATGACTT CTTTTTTTCA GGCTATTAGT 15540 

TCGTTACACC AAGAAAGTAC CCCAAGTGTC AACTCCAACT CTTGTAGAGG TCTCAAGAAA 15600 

CCTAGGAAAA GTGGGCAGCA AATGTTGTAA ACATCCTGAA GCAAAAAGAA TGCCCTGTGC 15660 

AGAAGACTAT GTGAGTCTTT AAAAAAATAT AATAAATTAA TAATGAAAAA ATTTTACCTT 15720 

TAGATATTGA TAATGCTAGC TTTCATAAGC AGAAGGAAGT AATGTGTGTG TGTGCATGTT 15780 

TGTGTGCATG TGTGTGTGCA TGCACGTGTG TGTATGTGTG ATATTCGCAG TCAAGGCCCC 15840 

GAGGATGATA ATTTTTTTTT TTTTTTTGAG ACGGAGTCTC GCTTTGTTGT CCAGGCTGGA 15900 

GTGCAGTGGT GCCATCTCGG CTCACTGCAA CCTCCGCCTC CCAAGTTCAA GCCATTCTCC 15960 
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TGCCTCAGCC TCCCAAGTAG CTGGGACTAC AGGTGCATGC CACCATGCCT GGCTAATTTT 16020 

TTGTATTTTT AGTAGAAAAT TTTCAGCTTC ACCTCTTTTG AATTTCTGCT CTCCTGCCTG 16080 

TTCTTTAGCT ATCCGTGGTC CTGAACCAGT TATGTGTGTT GCATGAGAAA ACGCCAGTAA 16140 

GTGACAGAGT CACCAAATGC TGCACAGAAT CCTTGGTGAA CAGGCGACCA TGCTTTTCAG 16200 

CTCTGGAAGT CGATGAAACA TACGTTCCCA AAGAGTTTAA TGCTGAAACA TTCACCTTCC . 16260 

ATGCAGATAT ATGCACACTT TCTGAGAAGG AGAGACAAAT CAAGAAACAA ACGTGAGGAG 16320 

TATTTCATTA CTGCATGTGT TTGTAGTCTT GATAGCAAGA ACTGTCAATT CAAGCTAGCA 16380 

ACTTTTTCCT GAAGTAGTGA TTATATTTCT TAGAGGAAAG TATTGGAGTG TTGCCCTTAT 16440 

TATGCTGATA AGAGTACCCA GAATAAAATG AATAACTTTT TAAAGACAAA ATCCTCTGTT 16500 

ATAATATTGC TAAAATTATT CAGAGTAATA TTGTGGATTA AAGCCACAAT AGAATAACAT 16560 

GTTAGACCAT ATTCAGTAGA AAAAGATGAA CAATTAACTG ATAAATTTGT GCACATGGCA 16620 

AATTAGTTAA TGGGAACCAT AGGAGAATTT ATTTCTAGAT GTAAATAATT ATTTTAAGTT 16680 

TGCCCTATGG TGGCCCCACA CATGAGACAA ACCCCCAAGA TGTGACTTTT GAGAATGAGA 16740 

CTTGGATAAA AAACATGTAG AAATGCAAGC CCTGAAGCTC AACTCCCTAT TGCTATCACA 16800 

GGGGTTATAA TTGCATAAAA TTTAGCTATA GAAAGTTGCT GTCATCTCTT GTGGGCTGTA 16860 

ATCATCGTCT AGGCTTAAGA GTAATATTCC AAAACCTGTC ATGCCCACAC AAATCTCTCC 16920 

CTGGCATTGT TGTCTTTGCA GATGTCAGTG AAAGAGAACC AGCAGCTCCC ATGAGTTT3G 16980 

ATAGCCTTAT TTTCTATAGC CTCCCCACTG AAGGGAGCAA AGTTTAAGAA CCAAATATAA 17040 

AGTTTCTCAT CTTTATAGAT GAGAAAAATT TTAAATAAAG TCCAAGATAA TTAAATTTTT 17100 

AAGGATCATT TTTAGCTCTT TAATAGCAAT AAAACTCAAT ATGACATAAT ATGGCACTTC 17160 

CAAAATCTGA ATAATATATA ATTGCAATGA CATACTTCTT TTCAGAGATT TACTGAAAAG 17220 

AAATTTGTTG ACACTACATA ACGTGATGAG TGGTTTATAC TGATTGTTTC AGTTGGTCTT 17280 

CCCACCAACT CCATGAAAGT GGATTTTATT ATCCTCATCA TGCAGATGAG AATATTGAGA 17340 

CTTATAGCGG TATGCCTGGC CCAAGTACTC AGAGTTGCCT GGCTCCAAGA TTTATAATCT 17400 

TAAATGATGG GACTACCATC CTTACTCTCT CCATTTTTCT ATACGTGAGT AATGTTTTTT 17460 

CTCTTTTTTT OTTTTCTTTT TCCATTCAAA CTCAGTGCAC TPGTTGAGCT CGTGAAACAC 17520 

AAGCCCAAGG CAACAAAAGA GCAACTGAAA GCTGTTATGG ATGATTTCGC AGCTTTTGTA 17580 

GAGAAGTGCT GCAAGGCTGA CGATAAGGAG ACCTGCTTTG CCGAGGAGGT ACTACAGTTC 17640 

TCTTCATTTT AATATGTCCA GTATTCATTT TTGCATGTTT GGTTAGGCTA GGGCTTAGGG 17700 

ATTTATATAT CAAAGGAGGC TTTCTACATG TGGGACAGGG ATCTTATTTT ACAAACAATT 17760 

GTCTTACAAA ATGAATAAAA CAGCACTTTG TTTTTATCTC CTGCTCTATT GTCCCATACT 17820 



SUBSTITUTE SHEET 



WO 93/03164 



102 



PCT/US92/06300 



GTTGAATGTT 


TATAATGCAT 


GTTCTGTTTC 


CAAATTTGTG 


ATGCTTATGA ATATTAATAG 


17880 


GAATATTTGT 


AAGGCCTGAA 


ATATTTTGAT 


CATGAAATCA 


AAACATTAAT TTATTTAAAC 


17940 


ATTTACTTGA 


AATGTGGTGG 


TTTGTGATTT 


AGTTGATTTT ATAGGCTAGT GGGAGAATTT 


18000 


ACATTCAAAT 


GTCTAAATCA 


CTTAAAATTT 


CCCTTTATGG 


CCTGACAGTA ACTTTTTTTT 


18060 


ATTCATTTGG 


GGACAACTAT 


GTCCGTGAGC 


TTCCATCCAG 


AGATTATAGT AGTAAA1TGT 


18120 


AATTAAAGGA 


TATGATGCAC 


GTGAAATCAC 


TTTGCAATCA 


TCAATAGCTT 


CATAAATCTT 


18180 


AATTTTGTAT 


CCTAATAGTA 


ATGCTAATAT 


TTTCCTAACA 


TCTGTCATGT 


CTTTGTGTTC 


18240 


AGG6TAAAAA 


ACTTGTTGCT 


GCAAGTCAAG 


CTGCCTTAGG 


CTTATAACAT 


CACATTTAAA 


18300 


AGCATCTCAG 


GTAACTATAT 


TTTGAATTTT 


TTAAAAAAGT 


AACTATAATA 


GTTATTATTA 


18360 


AAATAGCAAA 


GATTGACCAT 


TTCCAAGAGC 


CATATAGACC 


AGCACCGACC 


ACTATTCTAA 


18420 


ACTATTTATG 


TATGTAAATA 


TTAGCTTTTA 


AAATTCTCAA 


AATAGTTGCT 


GAGTTGGGAA 


18480 


CCACTATTAT 


TTCTATTTTG 


TAGATGAGAA 


AATGAAGATA AACATCAAAG 


CATAGATTAA 


18540 


GTAATTTTCC 


AAAGGGTCAA 


AATTCAAAAT 


TGAAACCAAG 


GTTTCAGTGT 


TGCCCATTGT 


18600 


CCTGTTCTGA 


CTTATATGAT 


GCGGTACACA 


GAGCCATCCA AGTAAGTGAT 


GGCTCAGCAG 


18660 


TGGAATACTC 


TGGGAATTAG 


GCTGAACCAC 


ATGAAAGAGT 


GCTTTATAGG 


GCAAAAACAG 


18720 


TTGAATATCA 


GTGATTTCAC 


ATGGTTCAAC 


CTAATAGTTC 


AACTCATCCT 


TTCCATTGGA 


18780 


GAATATGATG 


GATCTACCTT 


CTGTGAACTT 


TATAGTGAAG 


AATCTGCTAT 


TACATTTCCA 


18840 


ATTTGTCAAC 


ATGCTGAGCT 


TTAATAGGAC 


TTATCTTCTT 


ATGACAACAT 


TTATTGGTGT 


18900 


GTCCCCTTGC 


CTAGCCCAAC 


AGAAGAATTC 


AGCAGCCGTA 


AGTCTAGGAC 


AGGCTTAAAT 


18960 


TGTTTTCACT 


GGTGTAAATT 


GCAGAAAGAT 


GATCTAAGTA 


ATTTGGCATT 


TATTTTAATA 


19020 


GGTTTGAAAA 


ACACATGCCA 


TTTTACAAAT 


AAGACTTATA 


TTTGTCCTTT 


TGTTTTTCAG 


19080 


CCTACCATGA, 


GAATAAGAGA 


AAGAAAATGA 


AGATCAAAAG 


CTTATTCATC 


tgtttttctt 


19140 


TTTCGTTGGT 




CACCCTGTCT 


AAAAAACATA 


AArrrcrrrA 


ATCATTTTGC 


1 Q2ft ft 


CTCTTTTCTC 


TGTGCTTCAA 


TTAATAAAAA 


ATGGAAAGAA 


TCTAATAGAG 


TGGTACAGCA 


19260 


CTGTTATTTT 


TCAAAGATGT 


GTTGCTATCC 


TGAAAATTCT 


GTAGGTTCTG 


TGGAAGTTCC 


19320 


AGTGTTCTCT 


CTTATTCCAC 


TTCGGTAGAG 


GATTTCTAGT 


1TCTTGTGGG 


CTAATTAAAT 


19380 


AAATCATTAA 


TACTCTTCTA 


AGTTATGGAT 


TATAAACATT 


CAAAATAATA TTTTGACATT 


19440 


ATGATAATTC 


TGAATAAAAG 


AACAAAAACC 


ATGGTATAGG 


TAAGGAATAT AAAACATGGC 


19500 


TTTTACCTTA 


GAAAAAACAA 


TTCTAAAATT 


CATATGGAAT 


CAAAAAAGAG 


CCTGCAG 


19557 
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WHAT IS C1..AIMFD AS NEW AND IS DESIRED TO BE COVFRFD 1 1NDFR 
LETTERS PATENT IS : 

1 . A DN A construct comprising a 5' flanking sequence from a mammalian 
5 gene and a sequence coding for human serum albumin, wherein the human 

serum albumin sequence comprises at least one, but not all, of the introns in 
the naturally occurring gene encoding for the HSA protein. 

2. The construct of claim 1 wherein said gene is a milk protein gene. 

10 

3. The construct of claim 1 wherein said introns are selected to provide for 
expression of HSA in mammalian cells at levels equal to or greater than the 
naturally occurring HSA gene. 

15 4. The genetic construct of claim 3 wherein said Introns are selected from 
the group consisting of 1-6, 7-14, 1+7-14, 1+2 +12-14, 2 + 7-14 and 1+ 2 + 7- 
14. 

5. A DNA construct encoding HSA, comprising one but not all of the first 7 
20 introns of the HSA gene, and one of the last 7 introns of the HSA gene. 

6. A DNA construct encoding HSA comprising two contiguous exons 
encoding HSA and an HSA intron. 

25 7. A DNA construct comprising DNA sequences encoding human serum 
albumin operably linked to a mammary tissue specific promoter, said DNA 
construct expressed by the mammary glands of a lactating female transgenic 
mammal. 

30 8. A transgenic mammal having incorporated into its genome the DNA 
construct of claim 7. 

9. The transgenic mammal of claim B wherein the mammal is selected from 
the group consisting of mice, rabbits, sheep, goats, pigs and cattle. 

35 

10. The transgenic mammal of claim 8 wherein the promoter is the B- 
lactoglobulin protein promoter. 
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1 1 . The transgenic mammal of claim 8 wherein said mammal is a mouse 
having incorporated into its genome DNA construct encoding human serum 
albumin that is produced in the milk of a lactating mouse at levels of 

5 approximately 1 00 \ig/m\ of milk or greater. 

12. A method of making a transgenic mammal having incorporated into its 
genome a DNA construct encoding human serum albumin and a mammary 
tissue specific promoter, said DNA construct expressed by mammary glands of 

10 a lactating female transgenic mammal comprising providing a DNA construct 
containing the B-Iactoglobulin promoter operably linked with a nucleotide 
sequence encoding human serum albumin. 

13. The method of claim 12 further comprising microinjecting the DNA 

1 5 construct into the embryo of a mammal selected from the group consisting of 
mice, rabbits, sheep, goats, pigs and cattle. 

14. The method of claim 1 3 further comprising testing the animals for 
production of human serum albumin in the milk of lactating females and mating 

20 the animals containing the highest level of human serum albumin in the milk. 
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